← back to Blog

OpenCV

  • Guide to Underwater Image Enhancement Using OpenCV

    Have you ever captured amazing underwater footage, only to discover that your photos were plagued by poor visibility, muted colours, and a bluish-green haze? You’re not by yourself. As depth increases, warmer colours such as red, orange, and yellow are absorbed first, leaving images looking dull and low in contrast. In this post, we are…

    Read more →

  • Omni-Attribute: Open-Vocabulary Attribute Encoder for Visual Concept Personalization

    Omni-Attribute introduces a new paradigm for fine-grained visual concept personalization, solving a long-standing problem in image generation: how to transfer only the desired attribute (identity, hairstyle, lighting, style, etc.) without leaking irrelevant visual details. Developed by researchers from Snap Inc., UC Merced, and CMU, this work proposes the first open-vocabulary image attribute encoder explicitly designed…

    Read more →

  • From Image to 3D: The Structure from Motion in OpenCV

    We capture the world with cameras that compress depth, texture, and geometry into flat pixel grids, yet our minds effortlessly reconstruct the 3D structure behind them. What if computers could do the same? Structure-from-Motion (SfM) is the technique that enables this. By analyzing how features shift across multiple images, SfM simultaneously recovers the camera motion…

    Read more →

  • GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

    GeoVista introduces a new frontier in multimodal reasoning by enabling agentic geolocalization, a dynamic process where a model inspects high-resolution images, zooms into regions of interest, retrieves web information in real time, and iteratively reasons toward pinpointing a location. Developed by researchers from Fudan University, Tencent Hunyuan, Tsinghua University, and Shanghai Innovation Institute, GeoVista addresses the long-standing…

    Read more →

  • BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

    BlockVid represents a major leap forward in long-video generation, tackling one of the hardest open problems in video generation, i.e, producing coherent, high-fidelity, minute-long clips without collapse, drift, or degradation over time. Developed by DAMO Academy, ZIP Lab, and Hupan Lab, BlockVid enhances the semi-autoregressive block diffusion paradigm with innovations that directly address KV-cache error…

    Read more →

  • OpenCV HighGUI: Building Responsive and Customizable Interfaces

    Graphical User Interfaces (GUIs) are essential for interactive computer vision, enabling developers to visualize results, adjust parameters, and interact with applications in real time. While frameworks like PyQt and Tkinter are powerful, OpenCV’s HighGUI module offers a lightweight, cross-platform solution that integrates seamlessly with OpenCV, making it ideal for quick experiments, prototyping, and debugging. HighGUI…

    Read more →

  • Configuring Raspberry Pi for OpenCV: Camera & Cooling Workflow (2025 Refresh)

    Computer vision has evolved into one of the most approachable fields for anyone interested in practical AI, whether you’re a student, engineer, hobbyist, or maker. With the performance of the Raspberry Pi and the maturity of OpenCV, building reliable vision applications at home has become remarkably straightforward. This guide provides a complete walkthrough for getting…

    Read more →

  • OpenCV.js in Action: Live Webcam Filters and Effects

    OpenCV.js enables real-time webcam filters in the browser, allowing advanced computer vision effects without installations or native dependencies. From face blurring to artistic effects, it offers powerful visual processing. Powered by WebAssembly, OpenCV.js delivers near-native performance for smooth, complex transformations. This blog covers building live webcam filters with OpenCV.js, from setup to advanced effects, all…

    Read more →

  • OpenCV DNN : Bridging Classic Vision and Modern Deep Learning

    With all the buzz surrounding AI recently, OpenCV has been quietly evolving, adding a range of powerful new features. The OpenCV DNN module, in particular, has matured beautifully, aging like fine wine. As of November 2025, we can see several exciting additions in the latest release. But does it still deliver the same impact as…

    Read more →

  • WorldGrow: Generating Infinite 3D Worlds

    WorldGrow redefines 3D world generation by enabling infinite, continuous 3D scene creation through a hierarchical block-wise synthesis and inpainting pipeline. Developed by researchers from Shanghai Jiao Tong University, Huawei Inc., and Huazhong University of Science and Technology, it achieves unbounded, photorealistic, and geometrically coherent environments paving the way for scalable virtual world modeling for games,…

    Read more →