E-waste challenges of generative artificial intelligence
Generative artificial intelligence (GAI) requires substantial computational resources for
model training and inference, but the electronic-waste (e-waste) implications of GAI and its …
model training and inference, but the electronic-waste (e-waste) implications of GAI and its …
ChatGPT in the age of generative AI and large language models: a concise survey
ChatGPT is a large language model (LLM) created by OpenAI that has been carefully
trained on a large amount of data. It has revolutionized the field of natural language …
trained on a large amount of data. It has revolutionized the field of natural language …
Efficient training of large language models on distributed infrastructures: a survey
Large Language Models (LLMs) like GPT and LLaMA are revolutionizing the AI industry with
their sophisticated capabilities. Training these models requires vast GPU clusters and …
their sophisticated capabilities. Training these models requires vast GPU clusters and …
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators
Artificial intelligence (AI) methods have become critical in scientific applications to help
accelerate scientific discovery. Large language models (LLMs) are being considered as a …
accelerate scientific discovery. Large language models (LLMs) are being considered as a …
Comparative Study of Large Language Model Architectures on Frontier
Large language models (LLMs) have garnered significant attention in both the AI community
and beyond. Among these, the Generative Pre-trained Transformer (GPT) has emerged as …
and beyond. Among these, the Generative Pre-trained Transformer (GPT) has emerged as …
chatHPC: Empowering HPC users with large language models
The ever-growing number of pre-trained large language models (LLMs) across scientific
domains presents a challenge for application developers. While these models offer vast …
domains presents a challenge for application developers. While these models offer vast …
Tapi: Towards target-specific and adversarial prompt injection against code llms
Recently, code-oriented large language models (Code LLMs) have been widely and
successfully used to simplify and facilitate code programming. With these tools, developers …
successfully used to simplify and facilitate code programming. With these tools, developers …
AI-coupled HPC Workflow Applications, Middleware and Performance
AI integration is revolutionizing the landscape of HPC simulations, enhancing the
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …
Toward a holistic performance evaluation of large language models across diverse ai accelerators
Artificial intelligence (AI) methods have become critical in scientific applications to help
accelerate scientific discovery. Large language models (LLMs) are being considered a …
accelerate scientific discovery. Large language models (LLMs) are being considered a …
Optimizing distributed training on frontier for large language models
Large language models (LLMs) have demonstrated remarkable success as foundational
models, benefiting various downstream applications through fine-tuning. Loss scaling …
models, benefiting various downstream applications through fine-tuning. Loss scaling …