Athene-Llama3-70B Launched: An Open-Weight LLM Educated by way of RLHF primarily based on Llama-3-70B-Instruct


Nexusflow has launched Athene-Llama3-70B, an open-weight chat mannequin fine-tuned from Meta AI’s Llama-3-70B. Athene-70B has achieved an Area-Exhausting-Auto rating of 77.8%, rivaling proprietary fashions like GPT-4o and Claude-3.5-Sonnet. This marks a big enchancment from its predecessor, Llama-3-70B-Instruct, which scored 46.6%. The enhancement stems from Nexusflow’s focused post-training pipeline, designed to enhance particular mannequin behaviors. Athene-70B is presently present process public testing on Chatbot Area.

To maximise Llama-3-70B’s potential, Nexusflow developed inside benchmarks evaluating LLM capabilities in instruction following, coding, artistic writing, and multilingual duties. Based mostly on these evaluations, high-quality desire knowledge was curated for focused Reinforcement Studying from Human Suggestions (RLHF). This pipeline resulted in substantial efficiency enhancements in comparison with Llama-3-70B-Instruct. The enhancements span key elements equivalent to exact instruction following, math and reasoning, complete coding help, impressed artistic writing, and multilingual mastery.

Athene-70B demonstrates Nexusflow’s functionality to customise fashions for particular enterprise necessities by way of focused post-training. Constructing on earlier successes with Starling-7B and NexusRaven-V2, Nexusflow goals to advance its fashions to satisfy enterprise-grade software requirements. The corporate presents tailor-made options to assist companies excel in GenAI copilot and agent applied sciences. Nexusflow invitations organizations to discover how Athene-70B can improve their AI initiatives by contacting them for additional data and collaboration alternatives.

Athene-Llama3-70B, an open-weights chat mannequin developed by Nexusflow, demonstrates important enhancements over its predecessor. The mannequin achieves aggressive efficiency in comparison with proprietary fashions within the Area-Exhausting-Auto benchmark. Nexusflow’s focused post-training pipeline, using inside benchmarks and Reinforcement Studying from Human Suggestions, has enhanced the mannequin’s capabilities throughout varied domains, together with instruction following, math and reasoning, coding, artistic writing, and multilingual duties. This development showcases Nexusflow’s skill to tailor fashions for enterprise wants, constructing on their earlier successes. The corporate positions itself as a supplier of custom-made enterprise-grade AI options, inviting organizations to discover the potential of Athene-70B for his or her AI initiatives.


Take a look at the Model Card. All credit score for this analysis goes to the researchers of this venture. Additionally, don’t overlook to comply with us on Twitter and be part of our Telegram Channel and LinkedIn Group. Should you like our work, you’ll love our newsletter..

Don’t Overlook to affix our 46k+ ML SubReddit

Discover Upcoming AI Webinars here


Asjad is an intern guide at Marktechpost. He’s persuing B.Tech in mechanical engineering on the Indian Institute of Expertise, Kharagpur. Asjad is a Machine studying and deep studying fanatic who’s all the time researching the functions of machine studying in healthcare.



Leave a Reply

Your email address will not be published. Required fields are marked *