- Academic Search

Articles

Scholar

2 results (0.02 sec)

My profile My library

Rulearena: A benchmark for rule-guided reasoning with llms in real-world scenarios

Search within citing articles

[Free GPT-4]

[PDF] arxiv.org

Antileak-bench: Preventing data contamination by automatically constructing benchmarks with updated real-world knowledge

X Wu, L Pan, Y **e, R Zhou, S Zhao, Y Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

Data contamination hinders fair LLM evaluation by introducing test data into newer models'
training sets. Existing studies solve this challenge by updating benchmarks with newly …

Save Cite Cited by 3 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ntu.edu.sg

Towards effective neural topic modeling

X Wu - 2024 - dr.ntu.edu.sg

Over the past few decades, the world has witnessed an unprecedented explosion of
information. Of these, a substantial portion consists of unlabeled textual data, such as …

Save Cite Related articles View as HTML

Create alert

Cite

Advanced search

Saved to My library

Rulearena: A benchmark for rule-guided reasoning with llms in real-world scenarios

Antileak-bench: Preventing data contamination by automatically constructing benchmarks with updated real-world knowledge

Towards effective neural topic modeling