Home / Papers / Top Research Papers on LLMs

Top Research Papers on LLMs

Delve into the most influential research papers on LLMs and uncover key insights into language learning models. Our handpicked selection provides a comprehensive overview, making it easy for researchers and enthusiasts to stay updated with the latest advancements in the field. Whether you're a beginner or an expert, these papers will provide valuable knowledge and inspire new ideas.

Looking for research-backed answers?Try AI Search

Large Language Models

662 Citations 2023

Michael R Douglas

Communications of the ACM

Large Language ModelsIn the latest edition of Stats, STAT!, Fralick and colleagues explain the statistics behind large language models - used in chat bots like ChatGPT and Bard. While these new tools may seem remarkably intelligent, at their core they just assemble sentences based on statistics from large amounts of text.

What Large Language Models Know

1 Citations 2024

Rafael C. Alvarado

Critical AI

This essay argues that such failures—so-called hallucinations—are not accidental glitches but are instead a by-product of the design of the transformer architecture on which large language models are built, given its foundation on the distributional hypothesis, a nonreferential approach to meaning.

Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge

11 Citations 2023

J. Tan, M. Motani

ArXiv

This work firstly converts the input image into multiple suitable text-based abstraction spaces, then utilises the associative power of LLMs to derive the input-output relationship and map this to actions in the form of a working program, similar to Voyager / Ghost in the MineCraft.

Large language models (LLMs) for inferring genomic characteristics and facilitating genomic literacy in prostate cancer (PCa) patients.

No citations 2025

S. Naqvi, Umair Ayub, Muhammad Ali Khan + 9 more

Journal of Clinical Oncology

It is suggested that large language models can effectively extract genomic characteristics from unstructured reports and can potentially improve genomic literacy among prostate cancer patients by providing easy-to-interpret variant summaries, however, targeted prompting may be required to increase lexical diversity for improved engagement.

Large Language Models (LLMs): Representation Matters, Low-Resource Languages and Multi-Modal Architecture

2 Citations 2023

Ganesh Mani, Galane Basha Namomsa

2023 IEEE AFRICON

It is argued that the importance of representation as well as multi-modality are likely key to making the new generation of systems more powerful, usable, accessible and utile for all.

Large Language Model Programs

10 Citations 2023

Imanol Schlag, Sainbayar Sukhbaatar, Asli Celikyilmaz + 4 more

ArXiv

This work presents a method which further expands the capabilities of an LLM by embedding it within an algorithm or program, and presents an illustrative example of evidence-supported question-answering.

What are Large Language Models Doing?

No citations year unavailable

Keith Frankish

journal unavailable

It is implausible to think that LLMs possess communicative desires and perform speech acts, yet it is implausible not to think that LLMs possess communicative desires and perform speech acts.

Talking about Large Language Models

280 Citations 2022

M. Shanahan

Communications of the ACM

Interacting with a contemporary LLM-based conversational agent can create an illusion of being in the presence of a thinking creature, yet such systems are fundamentally not like us.

Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities

85 Citations 2024

Hao Zhou, Chengming Hu, Ye Yuan + 11 more

IEEE Communications Surveys & Tutorials

This work presents LLM fundamentals, including model architecture, pre-training, fine-tuning, inference and utilization, model evaluation, and telecom deployment, and introduces LLM-enabled key techniques and telecom applications in terms of generation, classification, optimization, and prediction problems.

LLaSM: Large Language and Speech Model

46 Citations 2023

Yu Shu, Siwei Dong, Guangyao Chen + 5 more

ArXiv

LLaSM is an end-to-end trained large multi-modal speech-language model with cross- modal conversational abilities, capable of following speech-and-language instructions.

Large language models (LLMs) in radiology exams for medical students: Performance and consequences.

No citations 2024

Jennifer Gotta, Quang Anh Le Hong, V. Koch + 16 more

RoFo : Fortschritte auf dem Gebiete der Rontgenstrahlen und der Nuklearmedizin

GPT-4 performed well on lower-order as well as higher-order questions, making ChatGPT-4 a potentially very useful tool for reviewing radiology exam questions, and Radiologists should be aware of ChatGPT's limitations, including its tendency to confidently provide incorrect responses.

Large Language Models for Telecom

1 Citations 2023

M. Debbah

2023 Eighth International Conference on Fog and Mobile Edge Computing (FMEC)

This talk will discuss the recent progress on LLM features and the potential of LLM in enabling intelligent wireless communication systems.

Large Language Models (LLMs) in Engineering Education: A Systematic Review and Suggestions for Practical Adoption

15 Citations 2024

S. Filippi, Barbara Motyl

Inf.

The use of large language models (LLMs) is now spreading in several areas of research and development. This work is concerned with systematically reviewing LLMs’ involvement in engineering education. Starting from a general research question, two queries were used to select 370 papers from the literature. Filtering them through several inclusion/exclusion criteria led to the selection of 20 papers. These were investigated based on eight dimensions to identify areas of engineering disciplines that involve LLMs, where they are most present, how this involvement takes place, and which LLM-based t...

ChatGPT and large language models (LLMs) awareness and use. A prospective cross-sectional survey of U.S. medical students

8 Citations 2024

Conner Ganjavi, M. Eppler, Devon O'Brien + 11 more

PLOS Digital Health

An electronic survey for students across North American medical colleges to gauge their views on and current use of ChatGPT and similar technologies found that 96% of respondents had heard of ChatGPT and 52% had used it for medical school coursework.

Transformative potential of Large Language Models (LLMs) in data mining on Electronic Health Records.

No citations 2024

Amadeo Wals Zurita, Héctor Miras del Rio, Nerea Ugarte Ruiz de Aguirre + 4 more

journal unavailable

The LLMs studied show competence comparable to that of medical specialists in the interpretation of clinical reports, even in complex and confusingly worded texts, and represent a preferred option over human analysis for data mining and structuring information in extensive sets of clinical reports.

Large Language Models (LLMs) Inference Offloading and Resource Allocation in Cloud-Edge Computing: An Active Inference Approach

22 Citations 2024

Ying He, Jingcheng Fang, F. Yu + 1 more

IEEE Transactions on Mobile Computing

This paper proposes a novel approach based on active inference for LLMs inference task offloading and resource allocation in cloud-edge computing that has superior performance over mainstream DRLs, improves in data utilization efficiency, and is more adaptable to changing task load scenarios.

Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning

No citations 2024

Sakhinana Sagar Srinivas, Chidaksh Ravuru, Geethan Sannidhi + 1 more

ArXiv

A hybrid approach that combines the strengths of open-source large and small-scale language models (LLMs and LMs) with traditional forecasting methods, outperforming existing methods by significant margins in terms of forecast accuracy.

Leveraging Large Language Model (LLM)[1] for Natural Language to SQL Query Generation in HR Analytics: A Case Study on IBM Attrition Dataset

No citations year unavailable

Mayur Sinha

journal unavailable

Exploring the Potential of Large Language Models (LLMs) for Low-resource Languages: A Study on Named-Entity Recognition (NER) and Part-Of-Speech (POS) Tagging for Nepali Language

4 Citations 2024

Bipesh Subedi, Sunil Regmi, B. Bal + 1 more

journal unavailable

This study specifically focuses on evaluating the performance of LLMs for Named Entity Recognition and Part-of-Speech tagging for a low-resource language, Nepali, and compares their performance with that of alternative approaches deployed for the tasks.

Large Language Models and Security

No citations 2024

Michele Bezzi

IEEE Security & Privacy

We analyze the security implications of large language models (LLMs) from their use as security tools for both attackers and defenders and the security of LLMs. We discuss how LLMs increase the scale of traditional threats such as social engineering and add new ones such as prompt injections.