AI News — Страница 114

AutoAgent: A Fully-Automated and Highly Self-Developing Framework that Enables Users to Create and Deploy LLM Agents through Natural Language Alone

8 марта, 2025

From business processes to scientific studies, AI agents can process huge datasets, streamline processes, and help in decision-making. Yet, even with all these developments, building and tailoring LLM agents is still a daunting task for most users. The main reason is that AI agent platforms require programming skills, restricting access to a mere fraction of…

Read more →

Salesforce AI Proposes ViUniT (Visual Unit Testing): An AI Framework to Improve the Reliability of Visual Programs by Automatically Generating Unit Tests by Leveraging LLMs and Diffusion Models

8 марта, 2025

Visual programming has emerged strongly in computer vision and AI, especially regarding image reasoning. Visual programming enables computers to create executable code that interacts with visual content to offer correct responses. These systems form the backbone of object detection, image captioning, and VQA applications. Its effectiveness stems from the ability to modularize multiple reasoning tasks,…

Read more →

Researchers from AMLab and CuspAI Introduced Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems

7 марта, 2025

Deep learning faces difficulties when applied to large physical systems on irregular grids, especially when interactions occur over long distances or at multiple scales. Handling these complexities becomes harder as the number of nodes increases. Several techniques have difficulty tackling these big problems, resulting in high computational costs and inefficiency. Some major issues are capturing…

Read more →

Microsoft AI Introduces Belief State Transformer (BST): Enhancing Goal-Conditioned Sequence Modeling with Bidirectional Context

7 марта, 2025

Transformer models have transformed language modeling by enabling large-scale text generation with emergent properties. However, they struggle with tasks that require extensive planning. Researchers have explored modifications in architecture, objectives, and algorithms to improve their ability to achieve goals. Some approaches move beyond traditional left-to-right sequence modeling by incorporating bidirectional context, as seen in models…

Read more →

Alibaba Researchers Propose START: A Novel Tool-Integrated Long CoT Reasoning LLM that Significantly Enhances Reasoning Capabilities by Leveraging External Tools

7 марта, 2025

Large language models have made significant strides in understanding and generating human-like text. Yet, when it comes to complex reasoning tasks—especially those that require multi-step calculations or logical analysis—they often struggle. Traditional chain-of-thought (CoT) approaches help by breaking down problems into intermediate steps, but they rely heavily on the model’s internal reasoning. This internal dependency…

Read more →

A Coding Guide to Sentiment Analysis of Customer Reviews Using IBM’s Open Source AI Model Granite-3B and Hugging Face Transformers

7 марта, 2025

In this tutorial, we will look into how to easily perform sentiment analysis on text data using IBM’s open-source Granite 3B model integrated with Hugging Face Transformers. Sentiment analysis, a widely-used natural language processing (NLP) technique, helps quickly identify the emotions expressed in text. It makes it invaluable for businesses aiming to understand customer feedback…

Read more →

Q-Filters: A Training-Free AI Method for Efficient KV Cache Compression

7 марта, 2025

Large Language Models (LLMs) have significantly advanced due to the Transformer architecture, with recent models like Gemini-Pro1.5, Claude-3, GPT4, and Llama3.1 demonstrating capabilities to process hundreds of thousands of tokens. However, these expanded context lengths introduce critical challenges for practical deployment. As sequence length increases, decoding latency escalates and memory constraints become severe bottlenecks. The…

Read more →

Starter Guide For Running Large Language Models LLMs

6 марта, 2025

Running large language models (LLMs) presents significant challenges due to their hardware demands, but numerous options exist to make these powerful tools accessible. Today’s landscape offers several approaches – from consuming models through APIs provided by major players like OpenAI and Anthropic, to deploying open-source alternatives via platforms such as Hugging Face and Ollama. Whether…

Read more →

AMD Releases Instella: A Series of Fully Open-Source State-of-the-Art 3B Parameter Language Model

6 марта, 2025

In today’s rapidly evolving digital landscape, the need for accessible, efficient language models is increasingly evident. Traditional large-scale models have advanced natural language understanding and generation considerably, yet they often remain out of reach for many researchers and smaller organizations. High training costs, proprietary restrictions, and a lack of transparency can hinder innovation and limit…

Read more →

CASS: Injecting Object-Level Context for Advanced Open-vocabulary semantic segmentation

6 марта, 2025

This paper was just accepted at CVPR 2025. In short, CASS is as an elegant solution to Object-Level Context in open-world segmentation. They outperform several training-free approaches and even surpasses some methods that rely on extra training. The gains are especially notable in challenging setups where objects have intricate sub-parts or classes have high visual…

Read more →