Big scholarly data: A survey

F **a, W Wang, TM Bekele, H Liu - IEEE Transactions on Big …, 2017 - ieeexplore.ieee.org
With the rapid growth of digital publishing, harvesting, managing, and analyzing scholarly
information have become increasingly challenging. The term Big Scholarly Data is coined …

Figureseer: Parsing result-figures in research papers

N Siegel, Z Horvitz, R Levin, S Divvala… - Computer Vision–ECCV …, 2016 - Springer
Abstract 'Which are the pedestrian detectors that yield a precision above 95% at 25%
recall?'Answering such a complex query involves identifying and analyzing the results …

Extracting scientific figures with distantly supervised neural networks

N Siegel, N Lourie, R Power, W Ammar - … of the 18th ACM/IEEE on joint …, 2018 - dl.acm.org
Non-textual components such as charts, diagrams and tables provide key information in
many scientific documents, but the lack of large labeled datasets has impeded the …

A survey on scholarly data: From big data perspective

S Khan, X Liu, KA Shakil, M Alam - Information Processing & Management, 2017 - Elsevier
Recently, there has been a shifting focus of organizations and governments towards
digitization of academic and technical documents, adding a new facet to the concept of …

COVID-19-CT-CXR: a freely accessible and weakly labeled chest X-ray and CT image collection on COVID-19 from biomedical literature

Y Peng, Y Tang, S Lee, Y Zhu… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
The latest threat to global health is the COVID-19 outbreak. Although there exist large
datasets of chest X-rays (CXR) and computed tomography (CT) scans, few COVID-19 image …

Citeseerx: Ai in a digital library search engine

J Wu, KM Williams, HH Chen, M Khabsa, C Caragea… - AI Magazine, 2015 - ojs.aaai.org
CiteSeerX is a digital library search engine providing access to more than five million
scholarly documents with nearly a million users and millions of hits per day. We present key …

A survey of scholarly data visualization

J Liu, T Tang, W Wang, B Xu, X Kong, F **a - Ieee Access, 2018 - ieeexplore.ieee.org
Scholarly information usually contains millions of raw data, such as authors, papers,
citations, as well as scholarly networks. With the rapid growth of the digital publishing and …

Towards building a scholarly big data platform: Challenges, lessons and opportunities

Z Wu, J Wu, M Khabsa, K Williams… - IEEE/ACM Joint …, 2014 - ieeexplore.ieee.org
We introduce a big data platform that provides various services for harvesting scholarly
information and enabling efficient scholarly applications. The core architecture of the …

Scholarly big data information extraction and integration in the CiteSeerχ digital library

K Williams, J Wu, SR Choudhury… - 2014 IEEE 30th …, 2014 - ieeexplore.ieee.org
CiteSeer χ is a digital library that contains approximately 3.5 million scholarly documents
and receives between 2 and 4 million requests per day. In addition to making documents …

Figure metadata extraction from digital documents

SR Choudhury, P Mitra, A Kirk, S Szep… - 2013 12th …, 2013 - ieeexplore.ieee.org
Academic papers contain multiple figures (information graphics) representing important
findings and experimental results. Automatic data extraction from such figures and …