Skip to main content

Large Language Models (LLMs)

Open Source Large Language Models (LLMs)

This blog post will explore open source LLMs that are commercially viable.

  • Open-source LLMs are large language models that are available for anyone to use, modify and distribute.
  • This means that businesses and developers can use these models without having to pay licensing fees or worry about usage restrictions.


There are a number of benefits to using open-source LLMs:

  • Cost-effective. Open-source LLMs are typically free to use, which can save businesses a significant amount of money.
  • Customisable. Businesses can modify open-source LLMs to meet their specific needs. This can be useful for tasks such as fine-tuning the model for a particular domain or adding new features.
  • Transparent. The code for open-source LLMs is publicly available, which means that businesses can see how the model works and make sure that it is aligned with their values.

However, it's crucial to understand the limitations and ethical considerations associated with using LLMs.

A List of Open LLMs Available for Commercial Use and their applications

  • T5 (2019): A text-to-text transformer model that can be used for various tasks, such as translation, summarisation, and question answering.
  • RWKV 4 (2021): Based on a recurrent neural network (RNN) architecture, this model can handle infinite context length, making it suitable for long-form text generation.
  • Bloom (2022): A multilingual LLM with 176 billion parameters trained on a massive dataset of text and code. Bloom can perform various language-related tasks and generate different creative text formats.
  • ChatGLM (2023): A 6-billion parameter model designed specifically for chatbot applications. It is known for its ability to engage in human-like conversations.
  • Dolly (2023): This model was the world's first truly open instruction-tuned LLM. It excels at following instructions and completing tasks as instructed.
  • StableLM-Alpha (2023): A model from Stability AI trained on a large dataset of text and code. StableLM-Alpha is designed for stability and reliability in text generation.
  • MPT-7B (2023): This model sets a new standard for open-source, commercially usable LLMs. It is known for its long context length (84k) and suitability for various applications.
  • Falcon (2023): Trained on a massive web dataset, the Falcon model series includes models with different parameter sizes, suitable for a wide range of applications.
  • LLaMA 2 (2023): Released by Meta, this model series offers fine-tuned chat models and includes models with parameter sizes ranging from 7 billion to 70 billion. LLaMA 2 is known for its performance and customisable nature.
  • Mistral 7B (2023): A powerful 7-billion parameter model that utilises a sliding window approach to handle context lengths up to 16k. Mistral 7B is recognised for its efficiency and performance.
  • SantaCoder (2023): Designed for code generation, SantaCoder can assist developers in writing and understanding code.
  • CodeGen2 (2023): Another code-focused LLM trained on programming and natural language data, making it proficient in code understanding and generation tasks.
  • StarCoder (2023): A state-of-the-art LLM trained on a large dataset of code designed to be a powerful coding assistant.
  • Code Llama (2023): This code-specialised model from Meta is available in various sizes, catering to different computational resources and use cases. It can generate different types of code and assist with programming tasks.

Open-source LLMs offer many benefits for businesses and developers. They are cost-effective, customisable, and transparent. With the growing number of open-source LLMs available, businesses are increasingly adopting this technology.

If you are interested in learning more about open-source LLMs, I encourage you to check out the sources [1] mentioned in this blog post. You can find a wealth of information on the GitHub repository, 'open-llms.'

Reference: 

[1] https://github.com/eugeneyan/open-llms


Hashtags:

#OpenSource, #LLMs, #TechBlogs, #FreeAIModels

Comments

Popular posts from this blog

BIG DATA ANALYTICS

BIG DATA ANALYTICS Have you ever hit upon how Amazon and Flip kart could possible verdict what we want; how the Google auto completes our search; how the YouTube looks into videos we want to watch? When we open YouTube, we will be at sixes and sevens, when we find ads related to what we have searched earlier in the past days. This is where we find ourselves in the era of big data analytics. More than 3 trillion bytes of information are being generated everyday through our smart phones, tablets, GPS devices, etc.  Have we thought about what can be done with all these information? This is where the data analytics comes into play. Big data analytics is just the study of future build up to store data in order to extract the behaviour patterns. The entire social networking website gathers our data which are related to our interest which is usually done by using our past search or any other social information. Data analytics will lead to a walkover in near future....

Amazon Q Developer Agents Can Now do more

Amazon Q Developer Agents Can Now do more than Helping You write Code Amazon Q Developer - Yes, yes that code generating assistant only. ⚡AWS just gave Amazon Q Developer a brain 🧠 boost. πŸ₯΄ Have you ever got bored of writing documents for the code you have written?🧐 πŸ₯Ί Asked senior dev for code review, which never happened because they are occupied with their own tasks? 🧐 I can hear you, saying 'Everytime' πŸ˜…. 🎟️ Now you can get some helping hand, from Amazon Q Developer Agents. πŸ€– This AI coding buddy can now write docs faster than you can say ' README.md ' πŸ” Review code like a caffeinated senior dev at 1 AM and throw out unit tests quicker than you can break the build. 🦸‍♂️ It's like having a super intern who never sleeps, doesn't drink all your coffee and won't steal your comfortable seating chair in office πŸ˜‰. 🐣Previously, ✏️(/dev) - can generate real time code suggestions based on your comments and existing code, bootstra...

A Conversation between Simba and Mufasa about AI

A Conversation between Simba 🐯 & Mufasa 🦁  about AI (Artificial Intelligence) Simba: Dad, have you heard about this thing called AI? Mufasa: Yes, my son. It’s a tool that can help us in many ways. Simba: But what if it takes over everything? What if one day, all the animals in the Pride Lands start asking AI to do their jobs? Mufasa: Simba, remember that while AI can assist us, it cannot replace the heart and spirit of the Circle of Life. Simba: So, you’re saying I shouldn’t worry? Mufasa: Exactly. Embrace AI as a friend, not a foe. It can help you hunt for ideas, but it can’t replace your instincts or your roar! Simba: So, I can still be the king, even with AI around? Mufasa: Of course! Just remember, the true strength of a king lies in his ability to adapt and grow, not just in what tools he uses. Simba: Thanks, Dad! I guess I’ll just have to learn to work with AI instead of worrying about it! Mufasa: That’s the spirit, my son. Now, let’s go find some lunch—AI can’t help ...