[HTML][HTML] NLM-Gene, a richly annotated gold standard dataset for gene entities that addresses ambiguity and multi-species gene recognition

R Islamaj, CH Wei, D Cissel, N Miliaras… - Journal of biomedical …, 2021 - Elsevier
The automatic recognition of gene names and their corresponding database identifiers in
biomedical text is an important first step for many downstream text-mining applications …

PENNER: Pattern-enhanced nested named entity recognition in biomedical literature

X Wang, Y Zhang, Q Li, CH Wu… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Many biomedical entity mentions contain other entity mentions nested inside. Most current
named entity recognition (NER) systems deal with only flat entities and ignore such nested …

Pattern discovery for wide-window open information extraction in biomedical literature

Q Li, X Wang, Y Zhang, F Ling… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Open information extraction is an important task in Biomedical domain. The goal of the
OpenIE is to automatically extract structured information from unstructured text with no or …

Phosphoproteome profiling revealed abnormally phosphorylated AMPK and ATF2 involved in glucose metabolism and tumorigenesis of GH-PAs

S Zhao, J Feng, C Li, H Gao, P Lv, J Li, Q Liu… - Journal of …, 2019 - Springer
Purpose Protein phosphorylation plays a key role in tumorigenesis and progression.
However, little is known about the phosphoproteome profiles of growth hormone-secreting …

Identification of Regions Required for CDCA7 Interaction with DNA Damage Repair Machinery

SR Jaff - 2023 - yorkspace.library.yorku.ca
Abstract CDCA7 (Cell Division Cycle Associated Protein 7) is a transcription factor protein
that binds to DNA and histone modifying enzymes supporting DNA methylation and …

Methods of computational interactomics for investigating interactions of human proteoforms

EV Poverennaya, OI Kiseleva, AS Ivanov… - Biochemistry …, 2020 - Springer
Abstract Human genome contains ca. 20,000 protein-coding genes that could be translated
into millions of unique protein species (proteoforms). Proteoforms coded by a. single gene …

МЕТОДЫ ВЫЧИСЛИТЕЛЬНОЙ ИНТЕРАКТОМИКИ В ВОПРОСАХ ВЗАИМОДЕЙСТВИЯ ПРОТЕОФОРМ ЧЕЛОВЕКА

ЕВ Поверенная, ОИ Киселева, АС Иванов… - Биохимия, 2020 - elibrary.ru
Для человека известно около 20 000 белок-кодирующих генов, которые могут быть
транслированы в миллионы уникальных видов белков (протеоформ). Протеоформы …

[HTML][HTML] Identification of a novel heat shock protein 33 of Pythium insidiosum from the first Chinese skin and subcutaneous Pythiosis

H Zhang, F Zhou, K Zhang - Informatics in Medicine Unlocked, 2023 - Elsevier
Objective To predict and analyze the structure and function of heat shock protein 33 by
bioinformatics analysis. Methods The physical and chemical properties, hydrophilicity …

Text Mining and Machine Learning Protocol for Extracting Human-Related Protein Phosphorylation Information from PubMed

K Arumugam, RR Shanker - Biomedical Text Mining, 2022 - Springer
In the modern health care research, protein phosphorylation has gained an enormous
attention from the researchers across the globe and requires automated approaches to …

Scientific knowledge extraction from massive text data

X Wang - 2022 - ideals.illinois.edu
Text mining is promising for advancing human knowledge in many fields, given the rapidly
growing volume of text data (eg, news reports, scientific articles, and medical notes) we are …