关注
Ryan Dubnicek
标题
引用次数
引用次数
年份
Impact of OCR quality on BERT embeddings in the domain classification of book excerpts
M Jiang, Y Hu, G Worthey, RC Dubnicek, T Underwood, JS Downie
Proceedings http://ceur-ws. org ISSN 1613, 0073, 2021
202021
The Gutenberg-HathiTrust parallel corpus: A real-world dataset for noise investigation in uncorrected OCR texts
M Jiang, Y Hu, G Worthey, RC Dubnicek, B Capitanu, D Kudeki, ...
iSchools, 2021
112021
The HathiTrust research center extracted features dataset (2.0)
J Jett, B Capitanu, D Kudeki, T Cole, Y Hu, P Organisciak, T Underwood, ...
HathiTrust Research Center, 2020
102020
Evaluating BERT's Encoding of Intrinsic Semantic Features of OCR'd Digital Library Collections
M Jiang, Y Hu, G Worthey, RC Dubnicek, T Underwood, JS Downie
2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 308-309, 2021
7*2021
" The library is open!": Open data and an open API for the HathiTrust Digital Library.
JA Walsh, G Layne-Worthey, J Jett, B Capitanu, P Organisciak, ...
CHR, 703-714, 2023
42023
Uncovering black fantastic: Piloting a word feature analysis and machine learning approach for genre classification
NN Parulian, R Dubnicek, G Worthey, DJ Evans, JA Walsh, JS Downie
Proceedings of the Association for Information Science and Technology 59 (1 …, 2022
42022
A prototype Gutenberg-Hathitrust sentence-level parallel corpus for OCR error analysis: Pilot investigations
M Jiang, RC Dubnicek, G Worthey, T Underwood, JS Downie
Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries, 1-5, 2022
22022
Tuning Out the Noise: Benchmarking Entity Extraction for Digitized Native American Literature
NN Parulian, R Dubnicek, DJ Evans, Y Hu, G Layne‐Worthey, JS Downie, ...
Proceedings of the Association for Information Science and Technology 60 (1 …, 2023
12023
Bridging the information gap between structural and note-level musical datasets
Y Hu, DM Weigl, KR Page, R Dubnicek, JS Downie
iConference 2019 Proceedings, 2019
12019
Exploring linked data benefits for digital library users
K Fenlon, J Jett, R Dubnicek, TW Cole, D Kudeki
2018 ASIS&T Annual Meeting, 799-800, 2018
12018
Creating A Disability Corpus for Literary Analysis: Pilot Classification Experiments
R Dubnicek, T Underwood, JS Downie
iConference 2018 Proceedings, 2018
12018
TORCHLITE: New, Open Analytical Tools and Infrastructure for a Mega‐Scale Digital Library
M Lamba, J Walsh, R Dubnicek, J Christie, JS Downie, J Swatscheno, ...
Proceedings of the Association for Information Science and Technology 61 (1 …, 2024
2024
Updates from HathiTrust Research Center:(Some of) What We’re Working On
R Dubnicek
2023
Piloting A Machine Learning Approach to Identify English-Language Fiction in the HathiTrust Digital Library.
R Dubnicek, T Underwood
DH, 2023
2023
Uncovering the Black Fantastic: Piloting Text Similarity Methods for Finding “Lost” Genre Fiction in HathiTrust (Poster)
NN Parulian, R Dubnicek, G Layne-Worthey, S Williams, C West-White, ...
ADHO 2022-Tokyo, 2022
2022
Introduction to and Hands-On Use Cases with HathiTrust Research Center's Extracted Features 2.0 Dataset
R Dubnicek, D Kudeki
2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 352-353, 2021
2021
Extending the Utility of the HTRC Extracted Features Dataset Through Linked Data
B Capitanu, TW Cole, JS Downie, R Dubnicek, J Jett, D Kudeki
2020
Piloting a workflow for extracting author citations from Samuel Johnson's Dictionary of the English Language
J Wong, R Dubnicek
iConference 2020 Proceedings, 2020
2020
Text mining with HathiTrust
ED Koehl, R Dubnicek
2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 451-452, 2019
2019
Exploring the Benefits for Users of Linked Open Data for Digitized Special Collections, White paper# 2: Analysis of Early User Feedback
K Fenlon, J Jett, R Dubnicek, T Cole, C Szylowicz, D Kudeki, M Zavala, ...
2018
系统目前无法执行此操作,请稍后再试。
文章 1–20