The determination of cluster number at k-mean using elbow method and purity evaluation on headline news

D Marutho, SH Handaka, E Wijaya - … international seminar on …, 2018 - ieeexplore.ieee.org
Information is one of the most important thing in our lives, while humans is naturally
impatient when searching for information from the internet. Users want to get the right …

Deep learning-based extraction of construction procedural constraints from construction regulations

B Zhong, X **ng, H Luo, Q Zhou, H Li, T Rose… - Advanced Engineering …, 2020 - Elsevier
Construction procedural constraints are critical in facilitating effective construction procedure
checking in practice and for various inspection systems. Nowadays, the manual extraction of …

Framework for syntactic string similarity measures

N Gali, R Mariescu-Istodor, D Hostettler… - Expert Systems with …, 2019 - Elsevier
Similarity measure is an essential component of information retrieval, document clustering,
text summarization, and question answering, among others. In this paper, we introduce a …

An efficient regular expression inference approach for relevant image extraction

HV Agun, E Uzun - Applied Soft Computing, 2023 - Elsevier
Traditional approaches for extracting relevant images automatically from web pages are
error-prone and time-consuming. To improve this task, operations such as preparing a larger …

Article segmentation in digitised newspapers with a 2d markov model

A Naoum, J Nothman, J Curran - 2019 International conference …, 2019 - ieeexplore.ieee.org
Document analysis and recognition is increasingly used to digitise collections of historical
books, newspapers and other periodicals. In the digital humanities, it is often the goal to …

H-rank: a keywords extraction method from web pages using POS tags

H Shah, MUS Khan, P Fränti - 2019 IEEE 17th International …, 2019 - ieeexplore.ieee.org
We present a new keywords extraction method that applies the semantic similarity among
the frequent words on the web page along with the distribution of POS tags. We apply …

Gene selection for enhanced classification on microarray data using a weighted k-NN based algorithm

E Ventura-Molina, A Alarcón-Paredes… - Intelligent Data …, 2019 - content.iospress.com
Feature selection is a common solution to microarray analysis. Previous approaches either
select features based on classical statistical tests that can be tuned up with a classifier, or …

Framework for location-aware search engine

A Tabarcea, N Gali, P Fränti - Journal of location Based services, 2017 - Taylor & Francis
Nowadays, a large part of the multimedia data on the Internet is generated with devices that
automatically annotate them with location information However, free-form content on …

Mopsi location-based service

P Fränti - 2024 - erepo.uef.fi
Mopsi is a location-based platform for storing photos and GPS tracks of the users. It allowed
user to share their data on-line with real-time user location, communicate with other users …

Combining statistical, structural, and linguistic features for keyword extraction from web pages

H Shah, P Fränti - 2022 - erepo.uef.fi
Keywords are commonly used to summarize text documents. In this paper, we perform a
systematic comparison of methods for automatic keyword extraction from web pages. The …