- Academic Search

T Wu, S He, J Liu, S Sun, K Liu… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org

ChatGPT, an artificial intelligence generated content (AIGC) model developed by OpenAI,
has attracted world-wide attention for its capability of dealing with challenging language …

Uložit Citovat Počet citací tohoto článku: 1172 Související články Všechny verze (počet: 4)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Recent advances in natural language processing via large pre-trained language models: A survey

B Min, H Ross, E Sulem, APB Veyseh… - ACM Computing …, 2023 - dl.acm.org

Large, pre-trained language models (PLMs) such as BERT and GPT have drastically
changed the Natural Language Processing (NLP) field. For numerous NLP tasks …

Uložit Citovat Počet citací tohoto článku: 1141 Související články Všechny verze (počet: 5)

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Nucleotide Transformer: building and evaluating robust foundation models for human genomics

H Dalla-Torre, L Gonzalez, J Mendoza-Revilla… - Nature …, 2024 - nature.com

The prediction of molecular phenotypes from DNA sequences remains a longstanding
challenge in genomics, often driven by limited annotated data and the inability to transfer …

Uložit Citovat Počet citací tohoto článku: 170 Související články Všechny verze (počet: 4)

[Free GPT-4]
[DeepSeek]

[PDF] science.org

Evolutionary-scale prediction of atomic-level protein structure with a language model

Z Lin, H Akin, R Rao, B Hie, Z Zhu, W Lu, N Smetanin… - Science, 2023 - science.org

Recent advances in machine learning have leveraged evolutionary information in multiple
sequence alignments to predict protein structure. We demonstrate direct inference of full …

Uložit Citovat Počet citací tohoto článku: 2419 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[HTML] google.com

[HTML][HTML] Modern language models refute Chomsky's approach to language

ST Piantadosi - From fieldwork to linguistic theory: A tribute to …, 2023 - books.google.com

Modern machine learning has subverted and bypassed the theoretical framework of
Chomsky's generative approach to linguistics, including its core claims to particular insights …

Uložit Citovat Počet citací tohoto článku: 192 Související články Všechny verze (počet: 3)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Explainability for large language models: A survey

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

Uložit Citovat Počet citací tohoto článku: 426 Související články Všechny verze (počet: 5)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

M Hanna, O Liu, A Variengien - Advances in Neural …, 2023 - proceedings.neurips.cc

Pre-trained language models can be surprisingly adept at tasks they were not explicitly
trained on, but how they implement these capabilities is poorly understood. In this paper, we …

Uložit Citovat Počet citací tohoto článku: 117 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] biorxiv.org

[PDF][PDF] Language models of protein sequences at the scale of evolution enable accurate structure prediction

Z Lin, H Akin, R Rao, B Hie, Z Zhu, W Lu… - BioRxiv, 2022 - biorxiv.org

Large language models have recently been shown to develop emergent capabilities with
scale, going beyond simple pattern matching to perform higher level reasoning and …

Uložit Citovat Počet citací tohoto článku: 582 Související články Všechny verze (počet: 3) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Uložit Citovat Počet citací tohoto článku: 4757 Související články Všechny verze (počet: 2) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Does localization inform editing? surprising differences in causality-based localization vs. knowledge editing in language models

P Hase, M Bansal, B Kim… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Language models learn a great quantity of factual information during pretraining,
and recent work localizes this information to specific model weights like mid-layer MLP …

Uložit Citovat Počet citací tohoto článku: 115 Související články Všechny verze (počet: 6) Zobrazit jako HTML

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

A brief overview of ChatGPT: The history, status quo and potential future development

Recent advances in natural language processing via large pre-trained language models: A survey

Nucleotide Transformer: building and evaluating robust foundation models for human genomics

Evolutionary-scale prediction of atomic-level protein structure with a language model

[HTML][HTML] Modern language models refute Chomsky's approach to language

Explainability for large language models: A survey

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

[PDF][PDF] Language models of protein sequences at the scale of evolution enable accurate structure prediction

On the opportunities and risks of foundation models

Does localization inform editing? surprising differences in causality-based localization vs. knowledge editing in language models