Home / Papers / Top Research Papers on LLMs

Top Research Papers on LLMs

Delve into the most influential research papers on LLMs and uncover key insights into language learning models. Our handpicked selection provides a comprehensive overview, making it easy for researchers and enthusiasts to stay updated with the latest advancements in the field. Whether you're a beginner or an expert, these papers will provide valuable knowledge and inspire new ideas.

Looking for research-backed answers?Try AI Search

Recommender Systems in the Era of Large Language Models (LLMs)

234 Citations 2024

Zihuai Zhao, Wenqi Fan, Jiatong Li + 8 more

IEEE Transactions on Knowledge and Data Engineering

This survey comprehensively review LLM-empowered recommender systems from various perspectives including pre-training, fine-tuning, and prompting paradigms, and comprehensively discusses the promising future directions in this emerging field.

Exploring the Potential of Large Language Models (LLMs)in Learning on Graphs

126 Citations 2024

Zhikai Chen, Haitao Mao, Hang Li + 8 more

ACM SIGKDD Explorations Newsletter

This paper aims to explore the potential of LLMs in graph machine learning, especially the node classification task, and investigates two possible pipelines: LLMs-as-Enhancers and LLMs-as-Predictors.

Large language models (LLMs): survey, technical frameworks, and future challenges

167 Citations 2024

Pranjal Kumar

Artificial Intelligence Review

This work provides a comprehensive overview of LLMs in the context of language modeling, word embeddings, and deep learning, and examines the application of LLMs in diverse fields including text generation, vision-language models, personalized learning, biomedicine, and code generation.

The ethics of ChatGPT in medicine and healthcare: a systematic review on Large Language Models (LLMs)

202 Citations 2024

Joschka Haltaufderheide, Robert Ranisch

npj Digital Medicine

The ethical guidance debate should be reframed to focus on defining what constitutes acceptable human oversight across the spectrum of applications, which involves considering the diversity of settings, varying potentials for harm, and different acceptable thresholds for performance and certainty in healthcare.

A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models

400 Citations 2024

Wenqi Fan, Yujuan Ding, Liangbo Ning + 5 more

journal unavailable

This survey comprehensively review existing research studies in RA-LLMs, covering three primary technical perspectives: architectures, training strategies, and applications, and systematically review mainstream relevant work by their architectures, training strategies, and application areas.

Exploring the use of large language models (LLMs) in chemical engineering education: Building core course problem models with Chat-GPT

162 Citations 2023

Meng‐Lin Tsai, Chong Wei Ong, Cheng‐Liang Chen

Education for Chemical Engineers

This study highlights the potential benefits of integrating Large Language Models (LLMs) into chemical engineering education. In this study, Chat-GPT, a user-friendly LLM, is used as a problem-solving tool. Chemical engineering education has traditionally focused on fundamental knowledge in the classroom with limited opportunities for hands-on problem-solving. To address this issue, our study proposes an LLMs-assisted problem-solving procedure. This approach promotes critical thinking, enhances problem-solving abilities, and facilitates a deeper understanding of core subjects. Furthermore, inc...

14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon

183 Citations 2023

Kevin Maik Jablonka, Qianxiang Ai, Alexander Al‐Feghali + 50 more

Digital Discovery

A hackathon that employed large-language models for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications indicates that LLMs will profoundly impact the future of the authors' fields.

Dissociating language and thought in large language models

242 Citations 2024

Kyle Mahowald, Anna A. Ivanova, Idan Blank + 3 more

Trends in Cognitive Sciences

Large language models (LLMs) have come closest among all models to date to mastering human language, yet opinions about their linguistic and cognitive capabilities remain split. Here, we evaluate LLMs using a distinction between formal linguistic competence (knowledge of linguistic rules and patterns) and functional linguistic competence (understanding and using language in the world). We ground this distinction in human neuroscience, which has shown that formal and functional competence rely on different neural mechanisms. Although LLMs are surprisingly good at formal competence, their perfor...

A Watermark for Large Language Models

113 Citations 2023

John Kirchenbauer, Jonas Geiping, Yuxin Wen + 3 more

arXiv (Cornell University)

A statistical test for detecting the watermark with interpretable p-values is proposed, and an information-theoretic framework for analyzing the sensitivity of the watermarks is derived.

Large language models in medicine

2832 Citations 2023

Arun James Thirunavukarasu, Darren Shu Jeng Ting, Kabilan Elangovan + 3 more

Nature Medicine

This review explains how large language models (LLMs), such as ChatGPT, are developed and discusses their strengths and limitations in the context of potential clinical applications, as a primer for interested clinicians.

Large Language Models: A Survey

199 Citations 2024

Shervin Minaee, Tomas Mikolov, Narjes Nikzad-Khasmakhi + 4 more

arXiv (Cornell University)

This paper reviews some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discusses their characteristics, contributions and limitations, and gives an overview of techniques developed to build, and augment LLMs.

A Survey of Large Language Models

1355 Citations 2023

Wayne Xin Zhao, Kun Zhou, Junyi Li + 19 more

arXiv (Cornell University)

Language is essentially a complex, intricate system of human expressions governed by grammatical rules. It poses a significant challenge to develop capable AI algorithms for comprehending and grasping a language. As a major approach, language modeling has been widely studied for language understanding and generation in the past two decades, evolving from statistical language models to neural language models. Recently, pre-trained language models (PLMs) have been proposed by pre-training Transformer models over large-scale corpora, showing strong capabilities in solving various NLP tasks. Since...

ChatGPT effects on cognitive skills of undergraduate students: Receiving instant responses from AI-based conversational large language models (LLMs)

204 Citations 2023

Harry Barton Essel, Dimitrios Vlachopoulos, Akosua Aya Essuman + 1 more

Computers and Education Artificial Intelligence

The study found that incorporating ChatGPT influenced the students’ critical, reflective, and creative thinking skills and their dimensions discernibly and provides suggestions for academics, instructional designers, and researchers working in educational technology.

A Survey on Model Compression for Large Language Models

125 Citations 2024

Xunyu Zhu, Jian Li, Yong Liu + 2 more

Transactions of the Association for Computational Linguistics

This paper presents a survey of model compression techniques for LLMs, covering methods like quantization, pruning, and knowledge distillation, highlighting recent advancements and offering valuable insights for researchers and practitioners.

A survey on large language models for recommendation

252 Citations 2024

Likang Wu, Zhi Zheng, Zhaopeng Qiu + 9 more

World Wide Web

A taxonomy that categorizes these models into two major paradigms, respectively Discriminative LLM for Recommendation (DLLM4Rec) and Generative LLM for Recommendation (GLLM4Rec), with the latter being systematically sorted out for the first time.

Science in the age of large language models

273 Citations 2023

Abeba Birhane, Atoosa Kasirzadeh, David Leslie + 1 more

Nature Reviews Physics

Four experts in artificial intelligence ethics and policy discuss potential risks and call for careful consideration and responsible usage to ensure that good scientific practices and trust in science are not compromised.

Talking about Large Language Models

239 Citations 2024

Murray Shanahan

Communications of the ACM

Interacting with a contemporary LLM-based conversational agent can create an illusion of being in the presence of a thinking creature, yet such systems are fundamentally not like us.

Role play with large language models

322 Citations 2023

Murray Shanahan, Kyle McDonell, Laria Reynolds

Nature

Two important cases of dialogue-agent behaviour are addressed this way, namely, (apparent) deception and (apparent) self-awareness, namely, (apparent) deception and (apparent) self-awareness.

Large language models and their impact in ophthalmology

117 Citations 2023

Bjorn Kaijun Betzler, Haichao Chen, Ching‐Yu Cheng + 22 more

The Lancet Digital Health

This Viewpoint seeks to stimulate broader discourse on the potential of large language models in ophthalmology and to galvanise both clinicians and researchers into tackling the prevailing challenges and optimising the benefits ofLarge language models while curtailing the associated risks.

Could a Large Language Model be Conscious?

125 Citations 2023

David J. Chalmers

arXiv (Cornell University)

It is concluded that while it is somewhat unlikely that current large language models are conscious, the possibility that successors to large language models may be conscious in the not-too-distant future should be taken seriously.