llama.node

Node binding of llama.cpp.

llama.cpp: Inference of LLaMA model in pure C/C++

Installation

npm install @fugood/llama.node

Usage

import { loadModel } from '@fugood/llama.node'

// Initial a Llama context with the model (may take a while)
const context = await loadModel({
  model: 'path/to/gguf/model',
  use_mlock: true,
  n_ctx: 2048,
  n_gpu_layers: 1, // > 0: enable GPU
  // embedding: true, // use embedding
  // lib_variant: 'opencl', // Change backend
})

// Do completion
const { text } = await context.completion(
  {
    prompt: 'This is a conversation between user and llama, a friendly chatbot. respond in simple markdown.\n\nUser: Hello!\nLlama:',
    n_predict: 100,
    stop: ['</s>', 'Llama:', 'User:'],
    // n_threads: 4,
  },
  (data) => {
    // This is a partial completion callback
    const { token } = data
  },
)
console.log('Result:', text)

Lib Variants

default: General usage, not support GPU except macOS (Metal)
vulkan: Support GPU Vulkan (Windows/Linux), but some scenario might unstable

License

MIT

Built and maintained by BRICKS.

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
.github		.github
.husky		.husky
.vscode		.vscode
lib		lib
scripts		scripts
src		src
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
.release-it.json		.release-it.json
CMakeLists.txt		CMakeLists.txt
README.md		README.md
babel.config.js		babel.config.js
commitlint.config.js		commitlint.config.js
package.json		package.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama.node

Installation

Usage

Lib Variants

License

About

Releases 10

Packages

Contributors 2

Languages

mybigday/llama.node

Folders and files

Latest commit

History

Repository files navigation

llama.node

Installation

Usage

Lib Variants

License

About

Topics

Resources

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 2

Languages

Packages