The determination of cluster number at k-mean using elbow method and purity evaluation on headline news
D Marutho, SH Handaka, E Wijaya - … international seminar on …, 2018 - ieeexplore.ieee.org
Information is one of the most important thing in our lives, while humans is naturally
impatient when searching for information from the internet. Users want to get the right …
impatient when searching for information from the internet. Users want to get the right …
Deep learning-based extraction of construction procedural constraints from construction regulations
Construction procedural constraints are critical in facilitating effective construction procedure
checking in practice and for various inspection systems. Nowadays, the manual extraction of …
checking in practice and for various inspection systems. Nowadays, the manual extraction of …
Framework for syntactic string similarity measures
Similarity measure is an essential component of information retrieval, document clustering,
text summarization, and question answering, among others. In this paper, we introduce a …
text summarization, and question answering, among others. In this paper, we introduce a …
An efficient regular expression inference approach for relevant image extraction
Traditional approaches for extracting relevant images automatically from web pages are
error-prone and time-consuming. To improve this task, operations such as preparing a larger …
error-prone and time-consuming. To improve this task, operations such as preparing a larger …
Article segmentation in digitised newspapers with a 2d markov model
Document analysis and recognition is increasingly used to digitise collections of historical
books, newspapers and other periodicals. In the digital humanities, it is often the goal to …
books, newspapers and other periodicals. In the digital humanities, it is often the goal to …
H-rank: a keywords extraction method from web pages using POS tags
We present a new keywords extraction method that applies the semantic similarity among
the frequent words on the web page along with the distribution of POS tags. We apply …
the frequent words on the web page along with the distribution of POS tags. We apply …
Gene selection for enhanced classification on microarray data using a weighted k-NN based algorithm
Feature selection is a common solution to microarray analysis. Previous approaches either
select features based on classical statistical tests that can be tuned up with a classifier, or …
select features based on classical statistical tests that can be tuned up with a classifier, or …
Framework for location-aware search engine
Nowadays, a large part of the multimedia data on the Internet is generated with devices that
automatically annotate them with location information However, free-form content on …
automatically annotate them with location information However, free-form content on …
Combining statistical, structural, and linguistic features for keyword extraction from web pages
H Shah, P Fränti - 2022 - erepo.uef.fi
Keywords are commonly used to summarize text documents. In this paper, we perform a
systematic comparison of methods for automatic keyword extraction from web pages. The …
systematic comparison of methods for automatic keyword extraction from web pages. The …