Serge is a chat interface based on llama.cpp for running alpaca models. Fully self-hosted, no API key required. Fits in 4GB RAM and runs on CPU.
- SvelteKit front end
- Redis for storing chat records and parameters
- FastAPI + langchain for API, wrapping calls to llama.cpp with python bindings
use
Setting up Serge is very simple and can be started in one command:
docker run -d \
-v weights:/usr/src/app/weights -v datadb:/data/db/ \
-p 8008:8008 ghcr.io/nsarrazin/serge:latest
Then just go to http://localhost:8008/ That’s it!
API documentation can be found at http://localhost:8008/api/docs turn up
#Serge #Homepage #Documentation #Downloads #Alpaca #Model #Chat #Interface #News Fast Delivery