Andrej Karpathy releases baby Llama

July 25, 2023

Less than 1 min.

Andrej Karpathy, former Tesla AI director, has released a simplified version of the Llama 2 language model that can run on a single computer.

The model, called “Baby Llama,” is based on the Llama 2 architecture but uses a much smaller number of parameters. This makes it possible to run the model on a laptop or other resource-constrained device.

The model can generate text at a rate of 100 tokens per second, which is significantly faster than other LLMs that can only run on GPUs and could lead to the development of new applications for LLMs, such as on-device language translation or chatbots.

Karpathy trained Baby Llama on the TinyStories dataset, which contains a collection of short stories. He reports that the model can generate text at a rate of 100 tokens per second. This is significantly faster than other LLMs that can only run on GPUs.

Karpathy’s work could also lead to the development of new applications for LLMs, such as on-device language translation or chatbots.

The sources for this piece include an article in AnalyticsIndiaMag.

Tags
Development

TND Newsdesk

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Andrej Karpathy releases baby Llama

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

Target’s new AI is aimed at employees

The good and the bad of AI generated code

Microsoft’s AI success may spell defeat for it’s climate goals

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Become a member