The rise of machine learning has had advancements in many fields, including the arts and media. One such advancement is the development of text-to-image (T2I) generative networks, which can create detailed images from textual descriptions. These networks offer exciting opportunities for creators but also pose risks, such as the potential for generating harmful content. Currently,…
The Graph Mining team within Google Research has introduced TeraHAC to address the challenge of clustering extremely large datasets with hundreds of billions of data points, primarily focusing on trillion-edge graphs used commonly in tasks like prediction and information retrieval. The graph clustering algorithms enable the merging of similar items into groups for a better…
Multimodal language models represent an emerging field in artificial intelligence that aims to enhance machine understanding of text and images. These models integrate visual and textual information to interpret and reason through complex data. Their capabilities span beyond simple text comprehension, pushing artificial intelligence toward more sophisticated realms where machine learning interacts seamlessly with the…
Natural language processing (NLP) focuses on enabling computers to understand and generate human language, making interactions more intuitive and efficient. Recent developments in this field have significantly impacted machine translation, chatbots, and automated text analysis. The need for machines to comprehend large amounts of text and provide accurate responses has led to the development of…
Contrastive learning typically matches pairs of related views among a number of unrelated negative views. Views can be generated (e.g. by augmentations) or be observed. We investigate matching when there are more than two related views which we call poly-view tasks, and derive new representation learning objectives using information maximization and sufficient statistics. We show…
Artificial Intelligence (AI) is changing the world quickly as several nations and international organizations have adopted frameworks to direct the development, application, and governance of AI. Numerous initiatives are influencing the ethical use of AI to prioritize human rights and innovation. Here are some of the top AI governance laws and frameworks. 1. EU AI…
Large Language Models (LLMs) often provide confident answers, raising concerns about their reliability, especially for factual questions. Despite widespread hallucination in LLM-generated content, no established method to assess response trustworthiness exists. Users lack a “trustworthiness score” to determine response reliability without further research or verification. The aim is for LLMs to yield predominantly high trust…
Multi-layer perceptrons (MLPs), or fully-connected feedforward neural networks, are fundamental in deep learning, serving as default models for approximating nonlinear functions. Despite their importance affirmed by the universal approximation theorem, they possess drawbacks. In applications like transformers, MLPs often monopolize parameters and lack interpretability compared to attention layers. While exploring alternatives, such as the Kolmogorov-Arnold…
Iterative preference optimization methods have shown efficacy in general instruction tuning tasks but yield limited improvements in reasoning tasks. These methods, utilizing preference optimization, enhance language model alignment with human requirements compared to sole supervised fine-tuning. Offline techniques like DPO are gaining popularity due to their simplicity and efficiency. Recent advancements advocate the iterative application…
This study’s research area is artificial intelligence (AI) and machine learning, specifically focusing on neural networks that can understand binary code. The aim is to automate reverse engineering processes by training AI to understand binaries and provide English descriptions. This is important because binaries can be challenging to comprehend due to their complexity and lack…