Large_language_model Search Results

Large language model

A large language model (LLM) is a computational model capable of language generation or other natural language processing tasks. As language models, LLMs...

155 KB (13,360 words) - 05:59, 27 August 2024

Language model

A language model is a probabilistic model of a natural language. In 1980, the first significant statistical language model was proposed, and during the...

14 KB (2,233 words) - 12:33, 16 July 2024

Claude (language model)

Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. Claude 3, released in March 2024, can...

12 KB (1,182 words) - 06:50, 20 August 2024

Llama (language model)

Llama (acronym for Large Language Model Meta AI, and formerly stylized as LLaMA) is a family of autoregressive large language models (LLMs) released by...

33 KB (3,395 words) - 10:35, 22 August 2024

BLOOM (language model)

Large Open-science Open-access Multilingual Language Model (BLOOM) is a 176-billion-parameter transformer-based autoregressive large language model (LLM)...

4 KB (500 words) - 23:41, 12 August 2024

Chinchilla (language model)

Chinchilla is a family of large language models developed by the research team at DeepMind, presented in March 2022. It is named "chinchilla" because...

7 KB (548 words) - 21:42, 7 August 2024

Gemini (language model)

Gemini is a family of multimodal large language models developed by Google DeepMind, serving as the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...

44 KB (3,469 words) - 08:18, 31 August 2024

T5 (language model)

Transfer Transformer) is a series of large language models developed by Google AI. Introduced in 2019, T5 models are trained on a massive dataset of text...

13 KB (1,280 words) - 00:33, 23 August 2024

BERT (language model)

of the art models, and as an early example of large language model. As of 2020[update], BERT was a ubiquitous baseline in Natural Language Processing...

29 KB (3,257 words) - 15:47, 12 August 2024

Prompt engineering (redirect from In-context learning (natural language processing))

generative AI model. A prompt is natural language text describing the task that an AI should perform: a prompt for a text-to-text language model can be a query...

52 KB (5,785 words) - 20:38, 1 September 2024

Mistral AI (section Mistral Large 2)

in the AI sector. The company focuses on producing open source large language models, emphasizing the foundational importance of free and open-source...

21 KB (2,191 words) - 19:29, 24 August 2024

Foundation model

A foundation model, also known as large AI model, is a machine learning or deep learning model that is trained on broad data such that it can be applied...

46 KB (5,057 words) - 15:00, 30 August 2024

Generative pre-trained transformer (redirect from GPT (language model))

Generative pre-trained transformers (GPTs) are a type of large language model (LLM) and a prominent framework for generative artificial intelligence. They...

47 KB (4,147 words) - 08:59, 15 August 2024

Transformer (deep learning architecture) (redirect from Transformer model)

Later variations have been widely adopted for training large language models (LLM) on large (language) datasets, such as the Wikipedia corpus and Common Crawl...

95 KB (11,902 words) - 02:56, 31 August 2024

Modeling language

and distributed systems. A large number of modeling languages appear in the literature. Example of graphical modeling languages in the field of computer...

22 KB (2,837 words) - 08:09, 12 July 2024

GPT-3 (redirect from GPT-3 (language model))

Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network...

54 KB (4,914 words) - 20:01, 1 September 2024

Jais (language model)

open-source large language model developed in the United Arab Emirates and launched in August 2023. It was trained on both English- and Arabic-language data...

3 KB (260 words) - 10:33, 19 June 2024

PaLM (redirect from Pathways Language Model)

PaLM (Pathways Language Model) is a 540 billion parameter transformer-based large language model developed by Google AI. Researchers also trained smaller...

12 KB (798 words) - 21:44, 30 June 2024

Open-source artificial intelligence (section Large language models)

development. LLaMA is a family of large language models released by Meta AI starting in February 2023. Meta claims these models are open-source software, but...

6 KB (505 words) - 03:04, 16 August 2024

Stochastic parrot (redirect from On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?)

describe the theory that large language models, though able to generate plausible language, do not understand the meaning of the language they process. The term...

22 KB (2,437 words) - 15:41, 1 September 2024

Model collapse

found. In the context of large language models, research found that training LLMs on predecessor-generated text—language models are trained on the synthetic...

15 KB (2,328 words) - 12:02, 31 August 2024

GPT-4o (category Large language models)

under different names on Large Model Systems Organization's (LMSYS) Chatbot Arena as three different models. These three models were called gpt2-chatbot...

17 KB (1,764 words) - 13:38, 1 September 2024

Perplexity.ai

works on a freemium model; the free product uses the company's standalone large language model (LLM) that incorporates natural language processing (NLP)...

14 KB (1,216 words) - 11:35, 20 August 2024

ChatGPT (category Large language models)

developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards...

195 KB (16,855 words) - 15:25, 1 September 2024

Multimodal learning (redirect from Multimodal model)

(2023-01-01). "BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models". arXiv:2301.12597 [cs.CV]. Alayrac...

7 KB (1,943 words) - 14:24, 1 June 2024

Retrieval-augmented generation (category Large language models)

information retrieval process. It modifies interactions with a large language model (LLM) so that the model responds to user queries with reference to a specified...

11 KB (1,097 words) - 05:38, 29 August 2024

Vector database

semantic search, multi-modal search, recommendations engines, large language models (LLMs), object detection, etc. Vector databases are also often used...

20 KB (1,510 words) - 21:56, 1 September 2024

Microsoft Copilot (section Languages)

artificial intelligence chatbot developed by Microsoft. Based on a large language model, it was launched in February 2023 as Microsoft's primary replacement...

53 KB (4,807 words) - 13:42, 29 August 2024

GPT-4 (category Large language models)

Transformer 4 (GPT-4) is a multimodal large language model created by OpenAI, and the fourth in its series of GPT foundation models. It was launched on March 14...

61 KB (5,899 words) - 17:49, 28 August 2024

Fine-tuning (deep learning) (section Natural language processing)

natural language processing (NLP), especially in the domain of language modeling. Large language models like OpenAI's series of GPT foundation models can...

13 KB (1,370 words) - 20:36, 22 July 2024