Top Research Papers on LLMs
Delve into the most influential research papers on LLMs and uncover key insights into language learning models. Our handpicked selection provides a comprehensive overview, making it easy for researchers and enthusiasts to stay updated with the latest advancements in the field. Whether you're a beginner or an expert, these papers will provide valuable knowledge and inspire new ideas.
Looking for research-backed answers?Try AI Search
Recommender Systems in the Era of Large Language Models (LLMs)
234 Citations 2024Zihuai Zhao, Wenqi Fan, Jiatong Li + 8 more
IEEE Transactions on Knowledge and Data Engineering
This survey comprehensively review LLM-empowered recommender systems from various perspectives including pre-training, fine-tuning, and prompting paradigms, and comprehensively discusses the promising future directions in this emerging field.
Exploring the Potential of Large Language Models (LLMs)in Learning on Graphs
126 Citations 2024Zhikai Chen, Haitao Mao, Hang Li + 8 more
ACM SIGKDD Explorations Newsletter
This paper aims to explore the potential of LLMs in graph machine learning, especially the node classification task, and investigates two possible pipelines: LLMs-as-Enhancers and LLMs-as-Predictors.
Large language models (LLMs): survey, technical frameworks, and future challenges
167 Citations 2024Pranjal Kumar
Artificial Intelligence Review
This work provides a comprehensive overview of LLMs in the context of language modeling, word embeddings, and deep learning, and examines the application of LLMs in diverse fields including text generation, vision-language models, personalized learning, biomedicine, and code generation.
The ethics of ChatGPT in medicine and healthcare: a systematic review on Large Language Models (LLMs)
202 Citations 2024Joschka Haltaufderheide, Robert Ranisch
npj Digital Medicine
The ethical guidance debate should be reframed to focus on defining what constitutes acceptable human oversight across the spectrum of applications, which involves considering the diversity of settings, varying potentials for harm, and different acceptable thresholds for performance and certainty in healthcare.
A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models
400 Citations 2024Wenqi Fan, Yujuan Ding, Liangbo Ning + 5 more
journal unavailable
This survey comprehensively review existing research studies in RA-LLMs, covering three primary technical perspectives: architectures, training strategies, and applications, and systematically review mainstream relevant work by their architectures, training strategies, and application areas.
Exploring the use of large language models (LLMs) in chemical engineering education: Building core course problem models with Chat-GPT
162 Citations 2023Meng‐Lin Tsai, Chong Wei Ong, Cheng‐Liang Chen
Education for Chemical Engineers
This study highlights the potential benefits of integrating Large Language Models (LLMs) into chemical engineering education. In this study, Chat-GPT, a user-friendly LLM, is used as a problem-solving tool. Chemical engineering education has traditionally focused on fundamental knowledge in the classroom with limited opportunities for hands-on problem-solving. To address this issue, our study proposes an LLMs-assisted problem-solving procedure. This approach promotes critical thinking, enhances problem-solving abilities, and facilitates a deeper understanding of core subjects. Furthermore, inc...
14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon
183 Citations 2023Kevin Maik Jablonka, Qianxiang Ai, Alexander Al‐Feghali + 50 more
Digital Discovery
A hackathon that employed large-language models for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications indicates that LLMs will profoundly impact the future of the authors' fields.
Dissociating language and thought in large language models
242 Citations 2024Kyle Mahowald, Anna A. Ivanova, Idan Blank + 3 more
Trends in Cognitive Sciences
Large language models (LLMs) have come closest among all models to date to mastering human language, yet opinions about their linguistic and cognitive capabilities remain split. Here, we evaluate LLMs using a distinction between formal linguistic competence (knowledge of linguistic rules and patterns) and functional linguistic competence (understanding and using language in the world). We ground this distinction in human neuroscience, which has shown that formal and functional competence rely on different neural mechanisms. Although LLMs are surprisingly good at formal competence, their perfor...
A Watermark for Large Language Models
113 Citations 2023John Kirchenbauer, Jonas Geiping, Yuxin Wen + 3 more
arXiv (Cornell University)
A statistical test for detecting the watermark with interpretable p-values is proposed, and an information-theoretic framework for analyzing the sensitivity of the watermarks is derived.
Large language models in medicine
2832 Citations 2023Arun James Thirunavukarasu, Darren Shu Jeng Ting, Kabilan Elangovan + 3 more
Nature Medicine
This review explains how large language models (LLMs), such as ChatGPT, are developed and discusses their strengths and limitations in the context of potential clinical applications, as a primer for interested clinicians.
Large Language Models: A Survey
199 Citations 2024Shervin Minaee, Tomas Mikolov, Narjes Nikzad-Khasmakhi + 4 more
arXiv (Cornell University)
This paper reviews some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discusses their characteristics, contributions and limitations, and gives an overview of techniques developed to build, and augment LLMs.
A Survey of Large Language Models
1355 Citations 2023Wayne Xin Zhao, Kun Zhou, Junyi Li + 19 more
arXiv (Cornell University)
Language is essentially a complex, intricate system of human expressions governed by grammatical rules. It poses a significant challenge to develop capable AI algorithms for comprehending and grasping a language. As a major approach, language modeling has been widely studied for language understanding and generation in the past two decades, evolving from statistical language models to neural language models. Recently, pre-trained language models (PLMs) have been proposed by pre-training Transformer models over large-scale corpora, showing strong capabilities in solving various NLP tasks. Since...
ChatGPT effects on cognitive skills of undergraduate students: Receiving instant responses from AI-based conversational large language models (LLMs)
204 Citations 2023Harry Barton Essel, Dimitrios Vlachopoulos, Akosua Aya Essuman + 1 more
Computers and Education Artificial Intelligence
The study found that incorporating ChatGPT influenced the students’ critical, reflective, and creative thinking skills and their dimensions discernibly and provides suggestions for academics, instructional designers, and researchers working in educational technology.
A Survey on Model Compression for Large Language Models
125 Citations 2024Xunyu Zhu, Jian Li, Yong Liu + 2 more
Transactions of the Association for Computational Linguistics
This paper presents a survey of model compression techniques for LLMs, covering methods like quantization, pruning, and knowledge distillation, highlighting recent advancements and offering valuable insights for researchers and practitioners.
A survey on large language models for recommendation
252 Citations 2024Likang Wu, Zhi Zheng, Zhaopeng Qiu + 9 more
World Wide Web
A taxonomy that categorizes these models into two major paradigms, respectively Discriminative LLM for Recommendation (DLLM4Rec) and Generative LLM for Recommendation (GLLM4Rec), with the latter being systematically sorted out for the first time.
Science in the age of large language models
273 Citations 2023Abeba Birhane, Atoosa Kasirzadeh, David Leslie + 1 more
Nature Reviews Physics
Four experts in artificial intelligence ethics and policy discuss potential risks and call for careful consideration and responsible usage to ensure that good scientific practices and trust in science are not compromised.
Interacting with a contemporary LLM-based conversational agent can create an illusion of being in the presence of a thinking creature, yet such systems are fundamentally not like us.
Role play with large language models
322 Citations 2023Murray Shanahan, Kyle McDonell, Laria Reynolds
Nature
Two important cases of dialogue-agent behaviour are addressed this way, namely, (apparent) deception and (apparent) self-awareness, namely, (apparent) deception and (apparent) self-awareness.
Large language models and their impact in ophthalmology
117 Citations 2023Bjorn Kaijun Betzler, Haichao Chen, Ching‐Yu Cheng + 22 more
The Lancet Digital Health
This Viewpoint seeks to stimulate broader discourse on the potential of large language models in ophthalmology and to galvanise both clinicians and researchers into tackling the prevailing challenges and optimising the benefits ofLarge language models while curtailing the associated risks.
Multimodal Large Language Models: A Survey
170 Citations 2023Jiayang Wu, Wensheng Gan, Zefeng Chen + 2 more
journal unavailable
A range of multimodal products are introduced, focusing on the efforts of major technology companies, and a compilation of the latest algorithms and commonly used datasets are presented, providing researchers with valuable resources for experimentation and evaluation.
Emergent Abilities of Large Language Models
1008 Citations 2022Jason Lee, Yi Tay, Rishi Bommasani + 13 more
arXiv (Cornell University)
This paper discusses an unpredictable phenomenon that is referred to as emergent abilities of large language models, an ability to be emergent if it is not present in smaller models but is present in larger models.
A survey on multimodal large language models
414 Citations 2024Shukang Yin, Chaoyou Fu, Sirui Zhao + 4 more
National Science Review
This paper presents the basic formulation of the MLLM and delineates its related concepts, including architecture, training strategy and data, as well as evaluation, and introduces research topics about how MLLMs can be extended to support more granularity, modalities, languages and scenarios.
Could a Large Language Model be Conscious?
125 Citations 2023David J. Chalmers
arXiv (Cornell University)
It is concluded that while it is somewhat unlikely that current large language models are conscious, the possibility that successors to large language models may be conscious in the not-too-distant future should be taken seriously.
Using large language models in psychology
277 Citations 2023Dorottya Demszky, Diyi Yang, David S. Yeager + 15 more
Nature Reviews Psychology
Large language models (LLMs), such as OpenAI's GPT-4, Google's Bard or Meta's LLaMa, have created unprecedented opportunities for analysing and generating language data on a massive scale. Because language data have a central role in all areas of psychology, this new technology has the potential to transform the field. In this Perspective, we review the foundations of LLMs. We then explain how the way that LLMs are constructed enables them to effectively generate human-like linguistic output without the ability to think or feel like a human. We argue that although LLMs have the potential to ad...
Large Language Models in Finance: A Survey
189 Citations 2023Yinheng Li, Shaofei Wang, Han Ding + 1 more
journal unavailable
A decision framework is proposed to guide financial professionals in selecting the appropriate LLM solution based on their use case constraints around data, compute, and performance needs and provides a pathway from lightweight experimentation to heavy investment in customized LLMs.
Galactica: A Large Language Model for Science
256 Citations 2022Ross Taylor, Marcin Kardas, Guillem Cucurull + 6 more
arXiv (Cornell University)
Information overload is a major obstacle to scientific progress. The explosive growth in scientific literature and data has made it ever harder to discover useful insights in a large mass of information. Today scientific knowledge is accessed through search engines, but they are unable to organize scientific knowledge alone. In this paper we introduce Galactica: a large language model that can store, combine and reason about scientific knowledge. We train on a large scientific corpus of papers, reference material, knowledge bases and many other sources. We outperform existing models on a range...
Explainability for Large Language Models: A Survey
428 Citations 2024Haiyan Zhao, Hanjie Chen, Fan Yang + 6 more
ACM Transactions on Intelligent Systems and Technology
A taxonomy of explainability techniques is introduced and a structured overview of methods for explaining Transformer-based language models is provided, based on the training paradigms of LLMs: traditional fine-tuning-based paradigm and prompting-based paradigm.
A Comprehensive Overview of Large Language Models
342 Citations 2023Humza Naveed, Asad Ullah Khan, Shi Qiu + 6 more
arXiv (Cornell University)
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics, datasets, benchmarking, efficiency, and more. With the rapid development of techniques and regular breakthroughs in LLM research, it has become considerably challenging to perceive the bigger picture of the ad...
Challenges and Applications of Large Language Models
168 Citations 2023Jean Kaddour, Joshua Harris, Maximilian Mozes + 3 more
arXiv (Cornell University)
This paper aims to establish a systematic set of open problems and application successes so that ML researchers can comprehend the field's current state more quickly and become productive.
BloombergGPT: A Large Language Model for Finance
296 Citations 2023Shijie Wu, Ozan İrsoy, Steven Lu + 6 more
arXiv (Cornell University)
The use of NLP in the realm of financial technology is broad and complex, with applications ranging from sentiment analysis and named entity recognition to question answering. Large Language Models (LLMs) have been shown to be effective on a variety of tasks; however, no LLM specialized for the financial domain has been reported in literature. In this work, we present BloombergGPT, a 50 billion parameter language model that is trained on a wide range of financial data. We construct a 363 billion token dataset based on Bloomberg's extensive data sources, perhaps the largest domain-specific data...
A Survey on Evaluation of Large Language Models
2088 Citations 2024Yupeng Chang, Xu Wang, Jindong Wang + 13 more
ACM Transactions on Intelligent Systems and Technology
This paper presents a comprehensive review of these evaluation methods for LLMs, focusing on three key dimensions: what to evaluate, where to evaluate, and how to evaluate, and offers invaluable insights to researchers in the realm of LLMs evaluation.
A Survey on Evaluation of Large Language Models
193 Citations 2023Yupeng Chang, Xu Wang, Jindong Wang + 13 more
arXiv (Cornell University)
Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. As LLMs continue to play a vital role in both research and daily use, their evaluation becomes increasingly critical, not only at the task level, but also at the society level for better understanding of their potential risks. Over the past years, significant efforts have been made to examine LLMs from various perspectives. This paper presents a comprehensive review of these evaluation methods for LLMs, focusing on three key dimensions:...
A Comprehensive Overview of Large Language Models
290 Citations 2025Humza Naveed, Asad Ullah Khan, Shi Qiu + 6 more
ACM Transactions on Intelligent Systems and Technology
This review article is intended to provide a quick, comprehensive reference for the researchers and practitioners to draw insights from extensive, informative summaries of the existing works to advance the LLM research.
Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models
226 Citations 2023Yinlin Deng, Chunqiu Steven Xia, Haoran Peng + 2 more
journal unavailable
TitanFuzz is demonstrated that modern titanic LLMs can be leveraged to directly perform both generation-based and mutation-based fuzzing studied for decades, while being fully automated, generalizable, and applicable to domains challenging for traditional approaches.
Large Language Models Demonstrate the Potential of Statistical Learning in Language
122 Citations 2023Pablo Contreras Kallens, Ross Deans Kristensen‐McLachlan, Morten H. Christiansen
Cognitive Science
It is suggested that the most recent generation of Large Language Models (LLMs) might finally provide the computational tools to determine empirically how much of the human language ability can be acquired from linguistic experience.
Gender bias and stereotypes in Large Language Models
280 Citations 2023Hadas Kotek, Rikker Dockum, David Sun
journal unavailable
This paper investigates LLMs’ behavior with respect to gender stereotypes, a known issue for prior models, and suggests that LLMs must be carefully tested to ensure that they treat minoritized individuals and communities equitably.
Wordcraft: Story Writing With Large Language Models
309 Citations 2022Ann Yuan, Andy Coenen, Emily Reif + 1 more
journal unavailable
This work built Wordcraft, a text editor in which users collaborate with a generative language model to write a story, and shows that large language models enable novel co-writing experiences.
Creation and Adoption of Large Language Models in Medicine
361 Citations 2023Nigam H. Shah, David A. Entwistle, Michael A. Pfeffer
JAMA
The creation and use of LLMs in medicine need to be actively shaped by provisioning relevant training data, specifying the desired benefits, and evaluating the benefits via testing in real-world deployments.
Large Language Models in Medicine: The Potentials and Pitfalls
263 Citations 2024Jesutofunmi A. Omiye, Haiwen Gui, Shawheen J. Rezaei + 2 more
Annals of Internal Medicine
A comprehensive yet accessible overview of the rapidly changing landscape of LLMs in medicine is provided, and active research areas in the field that promise to improve LLMs' usability in health care contexts are highlighted.
A large language model for electronic health records
755 Citations 2022Xi Yang, Aokun Chen, Nima PourNejatian + 16 more
npj Digital Medicine
A large clinical language model—GatorTron—is developed from scratch and systematically evaluate it on five clinical NLP tasks including clinical concept extraction, medical relation extraction, semantic textual similarity, natural language inference (NLI), and medical question answering (MQA).