Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

[HTML][HTML] Identifying breakthrough scientific papers

P Savov, A Jatowt, R Nielek - Information Processing & Management, 2020 - Elsevier
Citation analysis does not tell the whole story about the innovativeness of scientific papers.
Works by prominent authors tend to receive disproportionately many citations, while …

Temporal information retrieval

N Kanhabua, A Anand - Proceedings of the 39th International ACM …, 2016 - dl.acm.org
The study of temporal dynamics and its impact can be framed within the so-called temporal
IR approaches, which explain how user behavior, document content and scale vary with …

Dating documents using graph convolution networks

S Vashishth, SS Dasgupta, SN Ray… - arxiv preprint arxiv …, 2019 - arxiv.org
Document date is essential for many important tasks, such as document retrieval,
summarization, event detection, etc. While existing approaches for these tasks assume …

Examining temporality in document classification

X Huang, M Paul - Proceedings of the 56th Annual Meeting of the …, 2018 - aclanthology.org
Many corpora span broad periods of time. Language processing models trained during one
time period may not work well in future time periods, and the best model may depend on …

Bitimebert: Extending pre-trained language representations with bi-temporal information

J Wang, A Jatowt, M Yoshikawa, Y Cai - Proceedings of the 46th …, 2023 - dl.acm.org
Time is an important aspect of documents and is used in a range of NLP and IR tasks. In this
work, we investigate methods for incorporating temporal information during pre-training to …

Interactive system for reasoning about document age

A Jatowt, R Campos - Proceedings of the 2017 ACM on Conference on …, 2017 - dl.acm.org
Recently, many historical texts have become digitized and made accessible for search and
browsing. Professionals who work with collections of such texts often need to verify the …

Neural graph embedding methods for natural language processing

S Vashishth - arxiv preprint arxiv:1911.03042, 2019 - arxiv.org
Knowledge graphs are structured representations of facts in a graph, where nodes represent
entities and edges represent relationships between them. Recent research has resulted in …

Generic method for detecting focus time of documents

A Jatowt, CMA Yeung, K Tanaka - Information Processing & Management, 2015 - Elsevier
Time is an important aspect of text documents. While some documents are atemporal, many
have strong temporal characteristics and contain contents related to time. Such documents …

Ad3: Attentive deep document dater

SN Ray, SS Dasgupta, P Talukdar - arxiv preprint arxiv:1902.02161, 2019 - arxiv.org
Knowledge of the creation date of documents facilitates several tasks such as
summarization, event extraction, temporally focused information extraction etc …