Andrej Karpathy releases baby Llama

Share post:

Andrej Karpathy, former Tesla AI director, has released a simplified version of the Llama 2 language model that can run on a single computer.

The model, called “Baby Llama,” is based on the Llama 2 architecture but uses a much smaller number of parameters. This makes it possible to run the model on a laptop or other resource-constrained device.

The model can generate text at a rate of 100 tokens per second, which is significantly faster than other LLMs that can only run on GPUs and could lead to the development of new applications for LLMs, such as on-device language translation or chatbots.

Karpathy trained Baby Llama on the TinyStories dataset, which contains a collection of short stories. He reports that the model can generate text at a rate of 100 tokens per second. This is significantly faster than other LLMs that can only run on GPUs.

Karpathy’s work could also lead to the development of new applications for LLMs, such as on-device language translation or chatbots.

The sources for this piece include an article in AnalyticsIndiaMag.

SUBSCRIBE NOW

Related articles

Microsoft’s AI success may spell defeat for it’s climate goals

Microsoft's ambitious strides in AI technology are now posing a significant challenge to its own climate goals, as...

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Ilya Sutskever, co-founder and chief scientist of OpenAI, has officially announced his departure from the company. This move...

OpenAI snubs Microsoft, launching GPT-4o only on macOS

OpenAI, despite Microsoft's substantial $10 billion investment, has chosen to release its new ChatGPT app exclusively on macOS,...

Apple to integrate ChatGPT into iPhones

Apple Inc. is on the brink of solidifying a deal with OpenAI to integrate the ChatGPT technology into...

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways