A survey on scholarly data: From big data perspective
Recently, there has been a shifting focus of organizations and governments towards
digitization of academic and technical documents, adding a new facet to the concept of …
digitization of academic and technical documents, adding a new facet to the concept of …
Visual detection with context for document layout analysis
We present 1) a work in progress method to visually segment key regions of scientific
articles using an object detection technique augmented with contextual features, and 2) a …
articles using an object detection technique augmented with contextual features, and 2) a …
[HTML][HTML] Continuous document layout analysis: Human-in-the-loop AI-based data curation, database, and evaluation in the domain of public affairs
In the digital era, the amount of digital documents generated each day have being
increasing exponentially with the years, to a point where it is unfeasible to process them …
increasing exponentially with the years, to a point where it is unfeasible to process them …
Sentence boundary extraction from scientific literature of electric double layer capacitor domain: tools and techniques
Given the growth of scientific literature on the web, particularly material science, acquiring
data precisely from the literature has become more significant. Material information systems …
data precisely from the literature has become more significant. Material information systems …
Analysing the requirements for an open research knowledge graph: use cases, quality requirements, and construction strategies
Current science communication has a number of drawbacks and bottlenecks which have
been subject of discussion lately: Among others, the rising number of published articles …
been subject of discussion lately: Among others, the rising number of published articles …
An effective scholarly search by combining inverted indices and structured search with citation networks analysis
The rapid growth in the number of scholarly documents on the Web and in other digital
platforms makes it challenging for researchers to find research publications most relevant to …
platforms makes it challenging for researchers to find research publications most relevant to …
NLPExplorer: Exploring the Universe of NLP Papers
Understanding the current research trends, problems, and their innovative solutions remains
a bottleneck due to the ever-increasing volume of scientific articles. In this paper, we …
a bottleneck due to the ever-increasing volume of scientific articles. In this paper, we …
Analyzing scientific publications using domain-specific word embedding and topic modelling
The scientific world is changing a tarapid pace, with new technology being developed and
new trends being set at an increasing frequency. This paper presents a framework for …
new trends being set at an increasing frequency. This paper presents a framework for …
Detecting in-line mathematical expressions in scientific documents
One of the issues in extracting natural language sentences from PDF documents is the
identification of non-textual elements in a sentence. In this paper, we report our preliminary …
identification of non-textual elements in a sentence. In this paper, we report our preliminary …
MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature
The number of published articles in the field of materials science is growing rapidly every
year. This comparatively unstructured data source, which contains a large amount of …
year. This comparatively unstructured data source, which contains a large amount of …