Skip to content

jayanti-prasad/llm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Large Language Models

What Is It?

I plan to use this repository as a centralized hub for all the activities I intend to undertake concerning Large Language Models (LLM). Specifically, I intend to share the following items here:

  • Significant papers related to LLMs
  • Open-source datasets
  • Open-source models
  • Important projects
  • Jupyter notebooks

Why Is There So Much Hype About LLMs?

First Step towards General Artificial Intelligence (GAI)

It has been claimed that LLMs bring us very close to achieving General Artificial Intelligence (GAI) – potentially on par with or even superior to human intelligence! While language is indeed a vital tool in our cognitive toolbox, verbal and textual communication does not fully encompass the breadth of our communicative capabilities. Moreover, the language we employ in communication is fraught with subjectivity, variability, and arbitrariness.

Emergent Behavior!

There are assertions that when LLMs are trained on extensive datasets, they begin to exhibit surprising behaviors, analogous to phase transitions in physics! This is an extraordinary proposition! Essentially, this implies the potential for LLMs to develop consciousness, a characteristic traditionally considered emblematic of human identity.

Existential Threats!

I believe this concern has been somewhat exaggerated. Claims about the consequences of LLMs range from being as disruptive as nuclear wars to pandemics, alien attacks, or asteroid impacts!

Job Losses and Diminished Mental Abilities

This aspect is perhaps the easiest to comprehend and holds considerable truth. However, it is not dissimilar from the effects of past inventions such as the printing press, camera, or calculator. As people become increasingly reliant on tools like LLMs, certain abilities might diminish, much like how widespread use of mobile navigation systems diminished the importance of having a strong sense of direction.

Most important papers !

  1. [https://arxiv.org/abs/1409.0473] (Neural Machine Translation by Jointly Learning to Align and Translate)
  2. [https://arxiv.org/abs/1706.03762] (Attention Is All You Need)
  3. [https://arxiv.org/abs/1810.04805] (BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding)

<<<<<<< HEAD

LLM Review Papers

  1. [https://arxiv.org/abs/2307.03109](A Survey on Evaluation of Large Language Models)
  2. [https://arxiv.org/abs/2112.04359](Ethical and social risks of harm from Language Models)
  3. [https://arxiv.org/abs/2108.07258](On the Opportunities and Risks of Foundation Models)
  4. [https://arxiv.org/abs/2206.07682](Emergent Abilities of Large Language Models)
  5. [https://arxiv.org/abs/2303.18223](A Survey of Large Language Models)
  6. [https://arxiv.org/abs/2309.01029](Explainability for Large Language Models: A Survey)
  7. [https://arxiv.org/abs/2303.12712](Sparks of Arti cial General Intelligence:Early experiments with GPT-4)
  8. [https://arxiv.org/abs/2311.07361](The Impact of Large Language Models on Scientific Discovery:a Preliminary Study using GPT-4)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published