Top Research Papers on LLMs
Delve into the most influential research papers on LLMs and uncover key insights into language learning models. Our handpicked selection provides a comprehensive overview, making it easy for researchers and enthusiasts to stay updated with the latest advancements in the field. Whether you're a beginner or an expert, these papers will provide valuable knowledge and inspire new ideas.
Looking for research-backed answers?Try AI Search
Recommender Systems in the Era of Large Language Models (LLMs)
234 Citations 2024Zihuai Zhao, Wenqi Fan, Jiatong Li + 8 more
IEEE Transactions on Knowledge and Data Engineering
This survey comprehensively review LLM-empowered recommender systems from various perspectives including pre-training, fine-tuning, and prompting paradigms, and comprehensively discusses the promising future directions in this emerging field.
Exploring the Potential of Large Language Models (LLMs)in Learning on Graphs
126 Citations 2024Zhikai Chen, Haitao Mao, Hang Li + 8 more
ACM SIGKDD Explorations Newsletter
This paper aims to explore the potential of LLMs in graph machine learning, especially the node classification task, and investigates two possible pipelines: LLMs-as-Enhancers and LLMs-as-Predictors.
Large language models (LLMs): survey, technical frameworks, and future challenges
167 Citations 2024Pranjal Kumar
Artificial Intelligence Review
This work provides a comprehensive overview of LLMs in the context of language modeling, word embeddings, and deep learning, and examines the application of LLMs in diverse fields including text generation, vision-language models, personalized learning, biomedicine, and code generation.
The ethics of ChatGPT in medicine and healthcare: a systematic review on Large Language Models (LLMs)
202 Citations 2024Joschka Haltaufderheide, Robert Ranisch
npj Digital Medicine
The ethical guidance debate should be reframed to focus on defining what constitutes acceptable human oversight across the spectrum of applications, which involves considering the diversity of settings, varying potentials for harm, and different acceptable thresholds for performance and certainty in healthcare.
A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models
400 Citations 2024Wenqi Fan, Yujuan Ding, Liangbo Ning + 5 more
journal unavailable
This survey comprehensively review existing research studies in RA-LLMs, covering three primary technical perspectives: architectures, training strategies, and applications, and systematically review mainstream relevant work by their architectures, training strategies, and application areas.
Exploring the use of large language models (LLMs) in chemical engineering education: Building core course problem models with Chat-GPT
162 Citations 2023Meng‐Lin Tsai, Chong Wei Ong, Cheng‐Liang Chen
Education for Chemical Engineers
This study highlights the potential benefits of integrating Large Language Models (LLMs) into chemical engineering education. In this study, Chat-GPT, a user-friendly LLM, is used as a problem-solving tool. Chemical engineering education has traditionally focused on fundamental knowledge in the classroom with limited opportunities for hands-on problem-solving. To address this issue, our study proposes an LLMs-assisted problem-solving procedure. This approach promotes critical thinking, enhances problem-solving abilities, and facilitates a deeper understanding of core subjects. Furthermore, inc...
14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon
183 Citations 2023Kevin Maik Jablonka, Qianxiang Ai, Alexander Al‐Feghali + 50 more
Digital Discovery
A hackathon that employed large-language models for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications indicates that LLMs will profoundly impact the future of the authors' fields.
Dissociating language and thought in large language models
242 Citations 2024Kyle Mahowald, Anna A. Ivanova, Idan Blank + 3 more
Trends in Cognitive Sciences
Large language models (LLMs) have come closest among all models to date to mastering human language, yet opinions about their linguistic and cognitive capabilities remain split. Here, we evaluate LLMs using a distinction between formal linguistic competence (knowledge of linguistic rules and patterns) and functional linguistic competence (understanding and using language in the world). We ground this distinction in human neuroscience, which has shown that formal and functional competence rely on different neural mechanisms. Although LLMs are surprisingly good at formal competence, their perfor...
A Watermark for Large Language Models
113 Citations 2023John Kirchenbauer, Jonas Geiping, Yuxin Wen + 3 more
arXiv (Cornell University)
A statistical test for detecting the watermark with interpretable p-values is proposed, and an information-theoretic framework for analyzing the sensitivity of the watermarks is derived.
Large language models in medicine
2832 Citations 2023Arun James Thirunavukarasu, Darren Shu Jeng Ting, Kabilan Elangovan + 3 more
Nature Medicine
This review explains how large language models (LLMs), such as ChatGPT, are developed and discusses their strengths and limitations in the context of potential clinical applications, as a primer for interested clinicians.
Large Language Models: A Survey
199 Citations 2024Shervin Minaee, Tomas Mikolov, Narjes Nikzad-Khasmakhi + 4 more
arXiv (Cornell University)
This paper reviews some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discusses their characteristics, contributions and limitations, and gives an overview of techniques developed to build, and augment LLMs.
A Survey of Large Language Models
1355 Citations 2023Wayne Xin Zhao, Kun Zhou, Junyi Li + 19 more
arXiv (Cornell University)
Language is essentially a complex, intricate system of human expressions governed by grammatical rules. It poses a significant challenge to develop capable AI algorithms for comprehending and grasping a language. As a major approach, language modeling has been widely studied for language understanding and generation in the past two decades, evolving from statistical language models to neural language models. Recently, pre-trained language models (PLMs) have been proposed by pre-training Transformer models over large-scale corpora, showing strong capabilities in solving various NLP tasks. Since...
ChatGPT effects on cognitive skills of undergraduate students: Receiving instant responses from AI-based conversational large language models (LLMs)
204 Citations 2023Harry Barton Essel, Dimitrios Vlachopoulos, Akosua Aya Essuman + 1 more
Computers and Education Artificial Intelligence
The study found that incorporating ChatGPT influenced the students’ critical, reflective, and creative thinking skills and their dimensions discernibly and provides suggestions for academics, instructional designers, and researchers working in educational technology.
A Survey on Model Compression for Large Language Models
125 Citations 2024Xunyu Zhu, Jian Li, Yong Liu + 2 more
Transactions of the Association for Computational Linguistics
This paper presents a survey of model compression techniques for LLMs, covering methods like quantization, pruning, and knowledge distillation, highlighting recent advancements and offering valuable insights for researchers and practitioners.
A survey on large language models for recommendation
252 Citations 2024Likang Wu, Zhi Zheng, Zhaopeng Qiu + 9 more
World Wide Web
A taxonomy that categorizes these models into two major paradigms, respectively Discriminative LLM for Recommendation (DLLM4Rec) and Generative LLM for Recommendation (GLLM4Rec), with the latter being systematically sorted out for the first time.
Science in the age of large language models
273 Citations 2023Abeba Birhane, Atoosa Kasirzadeh, David Leslie + 1 more
Nature Reviews Physics
Four experts in artificial intelligence ethics and policy discuss potential risks and call for careful consideration and responsible usage to ensure that good scientific practices and trust in science are not compromised.
Interacting with a contemporary LLM-based conversational agent can create an illusion of being in the presence of a thinking creature, yet such systems are fundamentally not like us.
Role play with large language models
322 Citations 2023Murray Shanahan, Kyle McDonell, Laria Reynolds
Nature
Two important cases of dialogue-agent behaviour are addressed this way, namely, (apparent) deception and (apparent) self-awareness, namely, (apparent) deception and (apparent) self-awareness.
Large language models and their impact in ophthalmology
117 Citations 2023Bjorn Kaijun Betzler, Haichao Chen, Ching‐Yu Cheng + 22 more
The Lancet Digital Health
This Viewpoint seeks to stimulate broader discourse on the potential of large language models in ophthalmology and to galvanise both clinicians and researchers into tackling the prevailing challenges and optimising the benefits ofLarge language models while curtailing the associated risks.
Could a Large Language Model be Conscious?
125 Citations 2023David J. Chalmers
arXiv (Cornell University)
It is concluded that while it is somewhat unlikely that current large language models are conscious, the possibility that successors to large language models may be conscious in the not-too-distant future should be taken seriously.