AI Definition

LLM (Large Language Model)

A neural network trained on huge amounts of text to predict and generate language.

Large language models are transformer neural networks trained on trillions of tokens of text and code. They learn statistical patterns of language deeply enough to reason, write, translate, summarize, and follow instructions.

The biggest commercial models in 2026 Claude, GPT, Gemini have hundreds of billions to trillions of parameters and context windows of hundreds of thousands of tokens. Open-weight families like Llama, Mistral, Qwen, and DeepSeek offer competitive alternatives that can be self-hosted.

Size is no longer the only thing that matters. Post-training (instruction tuning, RLHF, constitutional AI) and tool use have become as important as scale. The 'best' model depends on the task: reasoning vs. speed vs. cost vs. multimodality.

Tools related to LLM (Large Language Model)

Claude

Anthropic's flagship assistant for reasoning, writing, and coding.

ChatGPT

OpenAI's conversational AI with broad knowledge and tool use.

Gemini

Google's multimodal AI assistant integrated across Workspace.

Related concepts

Transformer

The neural network architecture behind nearly every modern AI model, introduced in 2017.

Fine-tuning

Continuing to train an existing model on your own data so it specializes for your task.

Context Window

The maximum number of tokens a model can attend to in a single request, including the response.

Reinforcement Learning from Human Feedback (RLHF)

Training a language model to prefer outputs humans rate higher.

Want help applying this in production?

Our engineers ship AI features into production every week. Tell us what you're building.

Get a Free Quote Contact Us