Large Language Model

A large language model is a parameterized machine-learning model trained to predict and generate sequences of tokens from statistical patterns learned across large text or multimodal corpora.

Large language models combine tokenization, embeddings, transformer layers, attention, training objectives and probabilistic decoding to generate or transform language.

Definition

A large language model is a learned system that estimates relationships between tokens and uses those estimates to continue, transform or classify sequences. The word “large” usually refers to some combination of parameter count, training data and computational scale rather than a strict threshold.

Operating model

Text is segmented into tokens, mapped into numerical representations and processed through repeated transformer layers. Attention mechanisms let the model weigh relationships between positions in the available context. During inference, the model produces a probability distribution over possible next tokens and a decoding strategy selects from that distribution.

Scope

The concept includes pretraining, supervised adaptation, preference optimization, prompting, context windows, tool use, retrieval augmentation, quantization and evaluation. It does not imply that every model has the same architecture or that text generation alone establishes factual reliability.

Electronic Artefacts position

Electronic Artefacts treats the LLM as one component inside a wider knowledge or creative system. Sources, retrieval, permissions, tools, interfaces, provenance and human review determine whether the model is useful and governable.

Limitations

Language models can produce plausible but unsupported statements, inherit dataset bias, lose information outside their context, expose sensitive inputs and behave differently under small prompt changes. Their outputs require evaluation appropriate to the domain.

References

See Vaswani et al., Retrieval-Augmented Generation, Generative AI, Provenance and Augmented Intelligence.

Large Language Model

Definition

Operating model

Scope

Electronic Artefacts position

Limitations

References

Record metadata

How to cite this record

How this entity connects.

How Large Language Models Actually Work

1 typed connections