icon

Inference of Meta's LLaMA model (and others) in pure C/C++ (source files)

llama_cpp_source-b4889-1-source

The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide variety of hardware - locally and in the cloud.

* Plain C/C++ implementation without any dependencies
* AVX and AVX2 support for x86 architectures
* 1.5-bit, 2-bit, 3-bit, 4-bit, 5-bit, 6-bit, and 8-bit integer quantization for faster inference and reduced memory use

Since its inception, the project has improved significantly thanks to many contributions. It is the main playground for developing new features for the ggml library.

パッケージ名
llama_cpp_source
リポジトリ
HaikuPorts
リポジトリソース
haikuports_x86_64
バージョン
b4889-1
ダウンロードサイズ
20.2 MB
ソースコードは入手可か
いいえ
カテゴリー
カテゴリー無し
このバージョンの閲覧回数
0