A close look at apple's analysis of reasoning in LLMs through problem complexity reveals limitations in current benchmark design and model interpretability.
llms · Research, Apple, Reasoning, Language Models
Anthropic launched Haiku, Sonnet, and Opus under the Claude 3 family, offering powerful capabilities across tasks.
llms · Anthropic, Claude, Language Models
Google’s next-gen Gemini 1.5 Pro uses Mixture-of-Experts to match Ultra performance at lower compute, with a breakthrough 1M token context window.
llms · Google, Gemini, Language Models
Gemini 1.5 Pro featured industry-leading long-context capabilities, enabling advanced reasoning over vast documents.
llms · Google, Gemini, Language Models
The first Gemini model from Google DeepMind, merging strengths from AlphaCode, Pathways, and large-scale training.
llms · Google, Gemini, Language Models
An optimized and cost-efficient variant of GPT-4 powering ChatGPT with custom GPTs, tools, and longer context.
llms · Openai, Gpt, Language Models
An improved Claude model with stronger reasoning, fewer hallucinations, and increased openness for public use.
llms · Anthropic, Claude, Language Models
A multimodal leap forward for OpenAI, capable of reasoning over images and text with more nuanced capabilities.
llms · Openai, Gpt, Language Models
The 175B parameter model that revolutionized natural language interfaces and powered the first wave of AI API tools.
llms · Openai, Gpt, Language Models
GPT-2 demonstrated surprisingly coherent text generation, sparking debate over AI safety and open-sourcing.
llms · Openai, Gpt, Language Models
OpenAI's first generative pre-trained transformer model, laying the foundation for large language models.
llms · Openai, Gpt, Language Models