TECHNOLOGY
llama.cpp
llama.cpp is an open-source C and C++ inference runtime for running large language models across consumer and server hardware.
active
canonical
v1.0.0
Role
llama.cpp runs supported language models locally through a portable native runtime and quantized model formats.
Use
It is useful for private prototypes, offline inference, model evaluation and local knowledge tools where hosted APIs are not required.
References
See the official llama.cpp repository and Open-Weight Model.
Identity and publication
Record metadata
- Entity ID
- ea:technology:llama-cpp
- Publication class
- canonical
- Status
- active
- Maturity
- production
- Confidence
- canonical
- Published
- 2026-06-24
- Modified
- 2026-06-24
- Version
- 1.0.0
Citation
How to cite this record
llama.cpp. 1.0.0. Electronic Artefacts, 2026-06-24. https://electronicartefacts.com/knowledge/technologies/llama-cpp/
TYPED RELATIONSHIPS
How this entity connects.
Each connection has an explicit predicate and a human-readable statement.
implementation
Uses technology
Local and Open Source AI Systems uses llama.cpp as a reference local inference runtime.
Local graph
1 typed connections
The accessible relationship list above contains the complete local graph. Interactive rendering is loaded progressively.