Llama.cpp
llama.cpp python library is a simple Python bindings for
@ggerganovllama.cpp.This package provides:
- Low-level access to C API via ctypes interface.
- High-level Python API for text completion
OpenAI-like APILangChaincompatibilityLlamaIndexcompatibility- OpenAI compatible web server
- Local Copilot replacement
- Function Calling support
- Vision API support
- Multiple Models
Overviewโ
Integration detailsโ
| Class | Package | Local | Serializable | JS support |
|---|---|---|---|---|
| ChatLlamaCpp | langchain-community | โ | โ | โ |
Model featuresโ
| Tool calling | Structured output | JSON mode | Image input | Audio input | Video input | Token-level streaming | Native async | Token usage | Logprobs |
|---|---|---|---|---|---|---|---|---|---|
| โ | โ | โ |