Big scholarly data: A survey
With the rapid growth of digital publishing, harvesting, managing, and analyzing scholarly
information have become increasingly challenging. The term Big Scholarly Data is coined …
information have become increasingly challenging. The term Big Scholarly Data is coined …
Figureseer: Parsing result-figures in research papers
Abstract 'Which are the pedestrian detectors that yield a precision above 95% at 25%
recall?'Answering such a complex query involves identifying and analyzing the results …
recall?'Answering such a complex query involves identifying and analyzing the results …
Extracting scientific figures with distantly supervised neural networks
Non-textual components such as charts, diagrams and tables provide key information in
many scientific documents, but the lack of large labeled datasets has impeded the …
many scientific documents, but the lack of large labeled datasets has impeded the …
A survey on scholarly data: From big data perspective
Recently, there has been a shifting focus of organizations and governments towards
digitization of academic and technical documents, adding a new facet to the concept of …
digitization of academic and technical documents, adding a new facet to the concept of …
COVID-19-CT-CXR: a freely accessible and weakly labeled chest X-ray and CT image collection on COVID-19 from biomedical literature
The latest threat to global health is the COVID-19 outbreak. Although there exist large
datasets of chest X-rays (CXR) and computed tomography (CT) scans, few COVID-19 image …
datasets of chest X-rays (CXR) and computed tomography (CT) scans, few COVID-19 image …
Citeseerx: Ai in a digital library search engine
CiteSeerX is a digital library search engine providing access to more than five million
scholarly documents with nearly a million users and millions of hits per day. We present key …
scholarly documents with nearly a million users and millions of hits per day. We present key …
A survey of scholarly data visualization
Scholarly information usually contains millions of raw data, such as authors, papers,
citations, as well as scholarly networks. With the rapid growth of the digital publishing and …
citations, as well as scholarly networks. With the rapid growth of the digital publishing and …
Towards building a scholarly big data platform: Challenges, lessons and opportunities
We introduce a big data platform that provides various services for harvesting scholarly
information and enabling efficient scholarly applications. The core architecture of the …
information and enabling efficient scholarly applications. The core architecture of the …
Scholarly big data information extraction and integration in the CiteSeerχ digital library
CiteSeer χ is a digital library that contains approximately 3.5 million scholarly documents
and receives between 2 and 4 million requests per day. In addition to making documents …
and receives between 2 and 4 million requests per day. In addition to making documents …
Figure metadata extraction from digital documents
Academic papers contain multiple figures (information graphics) representing important
findings and experimental results. Automatic data extraction from such figures and …
findings and experimental results. Automatic data extraction from such figures and …