A survey on multilingual large language models: Corpora, alignment, and bias

Y Xu, L Hu, J Zhao, Z Qiu, K XU, Y Ye, H Gu - arxiv preprint arxiv …, 2024 - arxiv.org
Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs)
have been developed to address the challenges faced in multilingual natural language …

Large scale foundation models for intelligent manufacturing applications: a survey

H Zhang, SD Semujju, Z Wang, X Lv, K Xu… - Journal of Intelligent …, 2025 - Springer
Although the applications of artificial intelligence especially deep learning have greatly
improved various aspects of intelligent manufacturing, they still face challenges for broader …

Hate Speech Detection using Large Language Models: A Comprehensive Review

A Albladi, M Islam, A Das, M Bigonah, Z Zhang… - IEEE …, 2025 - ieeexplore.ieee.org
The widespread use of social media and other online platforms has facilitated
unprecedented communication and information exchange. However, it has also led to the …

On behalf of the stakeholders: Trends in nlp model interpretability in the era of llms

N Calderon, R Reichart - arxiv preprint arxiv:2407.19200, 2024 - arxiv.org
Recent advancements in NLP systems, particularly with the introduction of LLMs, have led to
widespread adoption of these systems by a broad spectrum of users across various …

Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base

Z An, X Ding, YC Fu, CC Chu, Y Li, W Du - arxiv preprint arxiv:2408.00798, 2024 - arxiv.org
This paper introduces Golden-Retriever, designed to efficiently navigate vast industrial
knowledge bases, overcoming challenges in traditional LLM fine-tuning and RAG …

Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset

J Goldzycher, P Röttger, G Schneider - arxiv preprint arxiv:2403.19559, 2024 - arxiv.org
Hate speech detection models are only as good as the data they are trained on. Datasets
sourced from social media suffer from systematic gaps and biases, leading to unreliable …

What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

S Feng, H Wan, N Wang, Z Tan, M Luo… - arxiv preprint arxiv …, 2024 - arxiv.org
Social media bot detection has always been an arms race between advancements in
machine learning bot detectors and adversarial bot strategies to evade detection. In this …

Decoding Hate: Exploring Language Models' Reactions to Hate Speech

P Piot, J Parapar - arxiv preprint arxiv:2410.00775, 2024 - arxiv.org
Hate speech is a harmful form of online expression, often manifesting as derogatory posts. It
is a significant risk in digital environments. With the rise of Large Language Models (LLMs) …

Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification

J Cegin, B Pecher, J Simko, I Srba, M Bielikova… - arxiv preprint arxiv …, 2024 - arxiv.org
The generative large language models (LLMs) are increasingly used for data augmentation
tasks, where text samples are paraphrased (or generated anew) and then used for classifier …

Re-examining Sexism and Misogyny Classification with Annotator Attitudes

A Jiang, N Vitsakis, T Dinkar, G Abercrombie… - arxiv preprint arxiv …, 2024 - arxiv.org
Gender-Based Violence (GBV) is an increasing problem online, but existing datasets fail to
capture the plurality of possible annotator perspectives or ensure the representation of …