Hugging Face, a startup business in artificial intelligence (AI), has expanded its repertoire of services with an array of development tools tailored for data science applications, offering a shared platform similar to GitHub that is focused on AI.
This platform includes an assortment of code repositories, data collections, and models, as well as interfaces that demonstrate AI-driven applications.
A small yet remarkably effective team within the company, consisting of just two individuals who came together at the start of the year, is now making waves. This duo, known as H4—which stands for "helpful, honest, harmless, and huggy"—has its sights set on empowering the AI community. Their objective is to facilitate the creation of chatbots with capabilities akin to those of the widely recognized ChatGPT.
Following the launch of ChatGPT, H4 was established with the intent of exploring the potential of utilizing open-source tools and libraries to mimic similar functionalities, as shared by Lewis Tunstall, a machine learning engineer at Hugging Face and one of H4's members.
Expanding the AI Community's Toolbox
H4 has taken the lead on several new open-source language models. Their projects include Zephyr-7B-α, a specialized version of the Mistral 7B model designed for conversations, and a modification of the Falcon-40B model to better handle natural language requests.
Training these advanced models, H4 leverages a robust infrastructure, utilizing a powerful cluster of Nvidia A100 graphics processing units. Despite the geographical distance, with team members operating remotely in Europe, they receive comprehensive support from other Hugging Face teams specializing in model evaluation and testing.
H4 has opted for a compact team size, allowing for agility and adaptability in their research endeavors. Their work includes collaborating with external partners on joint project releases. Presently, they are delving into various techniques to align AI behaviors with human feedback, advancing the community's understanding of such methodologies.
Recently, H4 made their handbook public, which contains the source code and data they utilized for their Zephyr model, and they intend to continue sharing their innovations, maintaining transparency, and fostering collective progress within the AI field.
While H4's work does not generate direct revenue, it does contribute to Hugging Face's broader business objectives, such as the Expert Acceleration Program that aids enterprises in developing tailor-made AI solutions.
Despite the potential competitive landscape with other open-source AI initiatives, H4 maintains that its goal isn't to compete but to enrich the AI community by offering access to the core resources of its chat models.
Photo: Mohamed Nohassi