AI News — Страница 76

This AI Paper Introduces Group Think: A Token-Level Multi-Agent Reasoning Paradigm for Faster and Collaborative LLM Inference

24 мая, 2025

This AI Paper Introduces Group Think: A Token-Level Multi-Agent Reasoning Paradigm for Faster and Collaborative LLM Inference A prominent area of exploration involves enabling large language models (LLMs) to function collaboratively. Multi-agent systems powered by LLMs are now being examined for their potential to coordinate challenging problems by splitting tasks and working simultaneously. This direction…

Read more →

Evaluating Enterprise-Grade AI Assistants: A Benchmark for Complex, Voice-Driven Workflows

24 мая, 2025

Evaluating Enterprise-Grade AI Assistants: A Benchmark for Complex, Voice-Driven Workflows As businesses increasingly integrate AI assistants, assessing their performance in real-world tasks—especially through voice interactions—is essential. Existing evaluation methods tend to focus on broad conversational skills or limited, task-specific tool usage, which do not adequately measure an AI agent’s ability to manage complex, specialized workflows…

Read more →

Researchers from the National University of Singapore Introduce ‘Thinkless,’ an Adaptive Framework that Reduces Unnecessary Reasoning by up to 90% Using DeGRPO

23 мая, 2025

Introducing ‘Thinkless’: An Adaptive Framework for Efficient Language Model Reasoning Researchers from the National University of Singapore have developed a new framework known as Thinkless, designed to enhance the efficiency of language models by reducing unnecessary reasoning by up to 90%. This framework addresses a core challenge in current language models, where extensive reasoning processes…

Read more →

Researchers Introduce MMLONGBENCH: A Comprehensive Benchmark for Long-Context Vision-Language Models

23 мая, 2025

Researchers Introduce MMLONGBENCH: A Comprehensive Benchmark for Long-Context Vision-Language Models Recent advances in long-context (LC) modeling have significantly enhanced the capabilities of large language models (LLMs) and large vision-language models (LVLMs). Long-context vision-language models (LCVLMs) are now capable of processing hundreds of images and thousands of interleaved text tokens in a single forward pass. However,…

Read more →

Microsoft AI Introduces Magentic-UI: An Open-Source Agent Prototype that Works with People to Complete Complex Tasks that Require Multi-Step Planning and Browser Use

23 мая, 2025

Microsoft AI Introduces Magentic-UI: An Open-Source Agent Prototype Modern web usage encompasses various digital interactions, such as filling out forms, managing accounts, executing data queries, and navigating complex dashboards. Despite the web’s integration with productivity and work processes, many actions still require repetitive human input. This is particularly evident in environments needing detailed instructions or…

Read more →

Beyond Aha Moments: Structuring Reasoning in Large Language Models

22 мая, 2025

Beyond Aha Moments: Structuring Reasoning in Large Language Models Large Reasoning Models (LRMs) like OpenAI’s o1 and o3, DeepSeek-R1, Grok 3.5, and Gemini 2.5 Pro exhibit strong capabilities in long Chain of Thought (CoT) reasoning. These models often demonstrate advanced behaviors such as self-correction, backtracking, and verification, collectively referred to as “aha moments.” Such behaviors…

Read more →

Anthropic Releases Claude Opus 4 and Claude Sonnet 4: A Technical Leap in Reasoning, Coding, and AI Agent Design

22 мая, 2025

Anthropic Releases Claude Opus 4 and Claude Sonnet 4: A Technical Leap in Reasoning, Coding, and AI Agent Design Anthropic has announced the release of its next-generation language models: Claude Opus 4 and Claude Sonnet 4. This update marks significant technical refinements in the Claude model family, particularly in structured reasoning, software engineering, and autonomous…

Read more →

Technology Innovation Institute TII Releases Falcon-H1: Hybrid Transformer-SSM Language Models for Scalable, Multilingual, and Long-Context Understanding

22 мая, 2025

Technology Innovation Institute TII Releases Falcon-H1: Hybrid Transformer-SSM Language Models for Scalable, Multilingual, and Long-Context Understanding As language models scale, balancing expressivity, efficiency, and adaptability becomes increasingly challenging. Transformer architectures dominate due to their strong performance across a wide range of tasks but are computationally expensive—particularly for long-context scenarios—due to the quadratic complexity of self-attention.…

Read more →

This AI Paper Introduces MathCoder-VL and FigCodifier: Advancing Multimodal Mathematical Reasoning with Vision-to-Code Alignment

22 мая, 2025

«`html Advancing Multimodal Mathematical Reasoning with Vision-to-Code Alignment Multimodal mathematical reasoning enables machines to solve problems involving both textual information and visual components such as diagrams and figures. This capability is essential in education, automated tutoring, and document analysis, where problems are often presented with a combination of text and images. A significant challenge in…

Read more →

Google DeepMind Releases Gemma 3n: A Compact, High-Efficiency Multimodal AI Model for Real-Time On-Device Use

22 мая, 2025

Google DeepMind Releases Gemma 3n: A Compact, High-Efficiency Multimodal AI Model for Real-Time On-Device Use As demand for faster, smarter, and more private AI on mobile devices grows, researchers are reimagining how AI models operate. The next generation of AI is not only lighter and faster but also designed for local deployment. By embedding intelligence…

Read more →