Large language models (LLMs) have revolutionized natural language processing (NLP), particularly for English and other data-rich languages. However, this rapid advancement has created a significant development gap for underrepresented languages, with Cantonese being a prime example. Despite being spoken by over 85 million people and holding economic importance in regions like the Guangdong-Hong Kong-Macau Greater…
Large Language Models (LLMs), initially limited to text-based processing, faced significant challenges in comprehending visual data. This limitation led to the development of Visual Language Models (VLMs), which integrate visual understanding with language processing. Early models like VisualGLM, built on architectures such as BLIP-2 and ChatGLM-6B, represented initial efforts in multi-modal integration. However, these models…
The competition to develop the most advanced Large Language Models (LLMs) has seen major advancements, with the four AI giants, OpenAI, Meta, Anthropic, and Google DeepMind, at the forefront. These LLMs are reshaping industries and significantly impacting the AI-powered applications we use daily, such as virtual assistants, customer support chatbots, and translation services. As competition…
Machine learning has made significant advancements, particularly through deep learning techniques. These advancements rely heavily on optimization algorithms to train large-scale models for various tasks, including language processing and image classification. At the core of this process lies the challenge of minimizing complex, non-convex loss functions. Optimization algorithms like Stochastic Gradient Descent (SGD) & its…
Together AI has introduced a groundbreaking technique known as TEAL (Training-Free Activation Sparsity in LLMs) that has the potential to advance the field of efficient machine learning model inference significantly. The company, a leader in open-source AI models, has been exploring innovative ways to optimize model performance, especially in environments with limited memory resources. TEAL…
LLMs like GPT-4, MedPaLM-2, and Med-Gemini perform well on medical benchmarks but need help to replicate physicians’ diagnostic abilities. Unlike doctors who gather patient information through structured questioning and examinations, LLMs often need more logical consistency and specialized knowledge, leading to inadequate diagnostic reasoning. Although they can assist in initial screenings by leveraging medical corpora,…
GNNs have excelled in analyzing structured data but face challenges with dynamic, temporal graphs. Traditional forecasting, often used in fields like economics and biology, relied on statistical models for time-series data. Deep learning, particularly GNNs, shifted focus to non-Euclidean data like social and biological networks. However, applying GNNs to dynamic graphs, where relationships constantly evolve,…
Neural Architecture Search (NAS) has emerged as a powerful tool for automating the design of neural network architectures, providing a clear advantage over manual design methods. It significantly reduces the time and expert effort required in architecture development. However, traditional NAS faces significant challenges as it depends on extensive computational resources, particularly GPUs, to navigate…
Microsoft addresses the complex challenges of integrating geospatial data into machine learning workflows. Working with such data is difficult due to its heterogeneity, coming in multiple formats and varying resolutions, and its complexity, involving features like occlusions, scale variations, and atmospheric interference. Additionally, geospatial datasets are large and computationally expensive to process, while a lack…
Adapting 2D-based segmentation models to effectively process and segment 3D data presents a significant challenge in the field of computer vision. Traditional approaches often struggle to preserve the inherent spatial relationships in 3D data, leading to inaccuracies in segmentation. This challenge is critical for advancing applications like autonomous driving, robotics, and virtual reality, where a…