Toggle search
Search
Toggle menu
notifications
Toggle personal menu
Editing
Oobabooga
(section)
From llamawiki.ai
Views
Read
Edit
Edit source
View history
associated-pages
Page
Discussion
More actions
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Features == * 3 interface modes: default, notebook, and chat * Multiple model backends: [[transformers (library)|transformers]], [[llama.cpp]], [[Exllama|ExLlama]], [[AutoGPTQ]], [[GPTQ-for-LLaMa]] * Dropdown menu for quickly switching between different models * [[LoRA]] support: load and unload LoRAs on the fly, train a new LoRA * Precise instruction templates for chat mode, including [[Llama 2]], [[Alpaca]], [[Vicuna]], [[WizardLM]], [[StableLM]], and many others * Multimodal pipelines, including LLaVA and MiniGPT-4 * 8-bit and 4-bit inference through [[bitsandbytes]] * CPU mode for transformers models * DeepSpeed ZeRO-3 inference * Extensions * Custom chat characters * Very efficient text streaming * Markdown output with LaTeX rendering, to use for instance with GALACTICA * Nice HTML output for GPT-4chan * API, including endpoints for websocket streaming (see the examples)
Summary:
Please note that all contributions to llamawiki.ai may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
LlamaWiki:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)