←back to Blog

Salesforce Research Introduces INDICT: A Groundbreaking Framework Enhancing the Safety and Helpfulness of AI-Generated Code Across Diverse Programming Languages

The ability to automate and assist in coding has the potential to transform software development, making it faster and more efficient. However, ensuring these models produce helpful and secure code is the challenge. The intricate balance between functionality and safety is critical, especially when the generated code could be exploited maliciously.

In practical applications, LLMs often encounter difficulties when dealing with ambiguous or malicious instructions. These models might generate code that inadvertently includes security vulnerabilities or facilitates harmful attacks. This issue is more than just theoretical; real-world studies have shown significant risks. For instance, research on GitHub’s Copilot revealed that about 40% of the generated programs contained vulnerabilities. Mitigating these risks is essential to harness the full potential of LLMs in coding while maintaining safety.

Current methods to mitigate these risks include fine-tuning LLMs with datasets focused on safety and implementing rule-based detectors to identify insecure code patterns. While fine-tuning is beneficial, it often proves insufficient against highly sophisticated attack prompts. Creating quality safety-related data for fine-tuning can be costly and resource-intensive, involving experts with deep programming and cybersecurity knowledge. Although effective, rule-based systems may not cover all possible vulnerabilities, leaving gaps that can be exploited.

Researchers at Salesforce Research introduced a novel framework called INDICT. This framework is designed to enhance the safety and helpfulness of code generated by LLMs. INDICT employs a unique mechanism involving internal dialogues of critiques between two critics: one focused on safety and the other on helpfulness. This dual-critic system allows the model to receive comprehensive feedback, enabling it to refine its output iteratively. The critics are equipped with external knowledge sources, such as relevant code snippets and tools like web searches and code interpreters, to provide more informed and effective critiques.

The INDICT framework operates through two main stages: preemptive and post-hoc feedback. During the preemptive stage, the safety-driven critic evaluates the potential risks of generating the code. In contrast, the helpfulness-driven critic ensures the code aligns with the intended task requirements. This stage involves querying external knowledge sources to supplement the critics’ evaluations. The post-hoc stage reviews the generated code after its execution, allowing the critics to provide additional feedback based on observed outcomes. This dual-stage approach ensures the model anticipates potential issues and learns from the execution results to improve future outputs.

The evaluation of INDICT involved testing on eight diverse tasks across eight programming languages using LLMs ranging from 7 billion to 70 billion parameters. The results demonstrated significant improvements in both safety and helpfulness metrics. Specifically, the framework achieved a 10% absolute improvement in code quality across all tested models. For example, in the CyberSecEval-1 benchmark, INDICT improved the safety of generated code by up to 30%, with safety measures indicating that over 90% of outputs were secure. The helpfulness metric also showed substantial gains, with INDICT-enhanced models outperforming state-of-the-art baselines by up to 70%.

INDICT’s success lies in its ability to provide detailed, context-aware critiques that guide the LLMs to produce better code. The framework ensures the generated code is secure and functional by integrating safety and helpful feedback. This approach offers a more robust solution to the challenges of code generation by LLMs.

In conclusion, INDICT presents a groundbreaking framework for improving the safety and helpfulness of LLM-generated code. INDICT addresses the critical balance between functionality and security in code generation by employing a dual-critic system and leveraging external knowledge sources. The framework’s impressive performance across multiple benchmarks and programming languages highlights its potential to set new standards for responsible AI in coding.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter

Join our Telegram Channel and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our 46k+ ML SubReddit

The post Salesforce Research Introduces INDICT: A Groundbreaking Framework Enhancing the Safety and Helpfulness of AI-Generated Code Across Diverse Programming Languages appeared first on MarkTechPost.