Open source frameworks for building AI like ChatGPT and Bard



ChatGPT is built on LLM and Framework as we all know. In today's topic, we will introduce eight open source frameworks and models that you can use to build your own chatbot or AI product.

1. LLaMA

The LLaMA project is based on the Fundamental Language Model. which has 7 billion to 65 billion parameters. These models are trained on millions of tokens and publicly available datasets. As a result, LLaMA-13B outperforms GPT-3 (175B) and can show similar performance to models like LLaMA-65B, Chinchilla-70B and PaLM-540B.

GitHub: facebookresearch/llama

Demo:  Baize Lora 7B

2. Alpaca

Stanford Alpaca claims they can compete with ChatGPT and anyone can build their own ChatGPT for less than $600. The Alpaca 7B is fine-tuned from the LLaMA 7B model based on 52,000 instructions.

GitHub: tatsu-lab/stanford_alpaca

Demo: Alpaca-LoRA

3. Vicuna

Vicuna is fine-tuned from the LLaMA model, based on user shared conversations collected from ShareGPT. Vicuna-13B has already achieved 90% quality efficiency from OpenAI's ChatGPT and Google Bard. It outperforms the LLaMA and Stanford Alpaca models in about 90% of cases. Vicuna cost about $300 to train.

GitHub: lm-sys/FastChat

Demo: FastChat (lmsys.org)

4. OpenChatKit

OpenChatKit is an open source ChatGPT alternative complete toolkit with which you can build your own chatbot. It will provide instructions for training and fine-tuning your model. OpenChatKit will help you create chatbots after all. The GPT-NeoXT-Chat-Base-20B model already outperforms GPT-NoeX in query-response.

GitHub: togethercomputer/OpenChatKit

Demo: OpenChatKit 

Model card: togethercomputer/GPT-NeoXT-Chat-Base-20B

5. GPT4ALL

GPT4ALL is a community driven project trained on massive data such as code, assistant interactions, stories, etc. The team behind this model has made their dataset, model volume, data curation process, training code all open source. They also released a 4 bit version of this model that you can run on a laptop. You can also run this model in a Python client if you want.

GitHub: nomic-ai/gpt4al

Demo: GPT4All

Model card: nomic-ai/gpt4all-lora · Hugging Face

6. Raven RWKV

Raven RWKV 7B is an open source chatbot built on the RWKV language model that can produce results similar to ChatGPT. This model uses RNNs that match Transformers in terms of quality and scaling, and saves a lot of VRAM. Raven, fine-tuned from Stanford Alpaca, code-alpaca, and many other datasets.

GitHub: BlinkDL/ChatRWKV

Demo:  Raven RWKV 7B

Model card: BlinkDL/rwkv-4-raven

7. OPT

OPT or Open Pre-trained Transformer language model is not better than ChatGPT but it has shown great performance in Stereotypical Bias Analysis. You can integrate Alpa, Colossal-AI, CTranslate2, and FasterTransformer with it to get even better results.

It is on this list because its monthly downloads are like 624710 in text generation category.

GitHub: facebookresearch/metaseq

Demo: A Watermark for LLMs

Model card: facebook/opt-1.3b

8. Flan-T5-XXL

The Flan-T5-XXL is a finetune T5 model. Here is a kind of instruction based fine tuning. Such instructional fine tuning has dramatically increased the performance of models such as PaLM, T5, and U-PaLM. The Flan-T5-XXL model has also been fine-tuned with over 1,000 additional tasks that further enhance its performance.

GitHub: google-research/t5x

Demo: Chat Llm Streaming

Model card: google/flan-t5-xxl

last word

There are so many open source options available on the internet that I tried to highlight some of the best. These open source chatbots or models will get better in the next few months then it might even overtake ChatGPT in terms of performance.

So today until the next topic stay well.

Next Post Previous Post
No Comment
Add Comment
comment url