In Short. Nvidia has unveiled the Nemotron 70B model and it's trained on Llama 3.1 70B using RLHF. The model claims to beat GPT-4o and Claude 3.5 Sonnet based on LMSYS' Arena Hard benchmark, MT-Bench, and AlpacaEval. Nvidia says Nemotron 70B can correctly answer the 'strawberry' question without using additional reasoning tokens or CoT prompting. More @Wikipedia
Hover over any link to get a description of the article. Please note that search keywords are sometimes hidden within the full article and don't appear in the description or title.