A survey on scholarly data: From big data perspective

S Khan, X Liu, KA Shakil, M Alam - Information Processing & Management, 2017 - Elsevier
Recently, there has been a shifting focus of organizations and governments towards
digitization of academic and technical documents, adding a new facet to the concept of …

Visual detection with context for document layout analysis

C Soto, S Yoo - Proceedings of the 2019 Conference on Empirical …, 2019 - aclanthology.org
We present 1) a work in progress method to visually segment key regions of scientific
articles using an object detection technique augmented with contextual features, and 2) a …

[HTML][HTML] Continuous document layout analysis: Human-in-the-loop AI-based data curation, database, and evaluation in the domain of public affairs

A Peña, A Morales, J Fierrez, J Ortega-Garcia, I Puente… - Information …, 2024 - Elsevier
In the digital era, the amount of digital documents generated each day have being
increasing exponentially with the years, to a point where it is unfeasible to process them …

Sentence boundary extraction from scientific literature of electric double layer capacitor domain: tools and techniques

MSU Miah, J Sulaiman, TB Sarwar, A Naseer… - Applied Sciences, 2022 - mdpi.com
Given the growth of scientific literature on the web, particularly material science, acquiring
data precisely from the literature has become more significant. Material information systems …

Analysing the requirements for an open research knowledge graph: use cases, quality requirements, and construction strategies

A Brack, A Hoppe, M Stocker, S Auer… - International Journal on …, 2022 - Springer
Current science communication has a number of drawbacks and bottlenecks which have
been subject of discussion lately: Among others, the rising number of published articles …

An effective scholarly search by combining inverted indices and structured search with citation networks analysis

S Khalid, S Wu, A Wahid, A Alam, I Ullah - Ieee Access, 2021 - ieeexplore.ieee.org
The rapid growth in the number of scholarly documents on the Web and in other digital
platforms makes it challenging for researchers to find research publications most relevant to …

NLPExplorer: Exploring the Universe of NLP Papers

M Parmar, N Jain, P Jain, P Jayakrishna Sahit… - Advances in Information …, 2020 - Springer
Understanding the current research trends, problems, and their innovative solutions remains
a bottleneck due to the ever-increasing volume of scientific articles. In this paper, we …

Analyzing scientific publications using domain-specific word embedding and topic modelling

T Singhal, J Liu, LTM Blessing… - 2021 IEEE international …, 2021 - ieeexplore.ieee.org
The scientific world is changing a tarapid pace, with new technology being developed and
new trends being set at an increasing frequency. This paper presents a framework for …

Detecting in-line mathematical expressions in scientific documents

K Iwatsuki, T Sagara, T Hara, A Aizawa - … of the 2017 ACM symposium on …, 2017 - dl.acm.org
One of the issues in extracting natural language sentences from PDF documents is the
identification of non-textual elements in a sentence. In this paper, we report our preliminary …

MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature

S Guha, A Mullick, J Agrawal, S Ram, S Ghui… - Computational Materials …, 2021 - Elsevier
The number of published articles in the field of materials science is growing rapidly every
year. This comparatively unstructured data source, which contains a large amount of …