← back to Blog

CV

  • Edge Detection Using OpenCV

    Edge detection is a crucial technique in image processing and computer vision, used to identify sharp changes in brightness that typically signify object boundaries, edges, lines, or textures. It enables applications like object recognition, image segmentation, and tracking by highlighting the structural features of an image. In this blog, we’ll explore three of the most…

    Read more →

  • Image Rotation and Translation using OpenCV

    Image translation and rotation are two of the most common and essential operations in computer vision. Whether you’re aligning scanned documents, augmenting datasets for deep learning, correcting skewed camera input, or building a panorama stitching pipeline — these geometric transformations are at the core of modern image processing. In this tutorial, we’ll take a deep…

    Read more →

  • MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion

    MP-SfM redefines classical Structure-from-Motion by tightly integrating monocular depth and surface normal priors into incremental SfM, enabling robust 3D reconstruction from sparse, unstructured image collections. Key Highlights: Monocular Depth + Surface Normal Fusion – Augments traditional SfM with priors from off-the-shelf deep networks (e.g., Metric3D-v2, DSINE), eliminating the need for three-view overlap. Two-View Track Reconstruction – Enables…

    Read more →

  • Image Filtering using Convolution in OpenCV

    Image filtering is one of the fundamental building blocks of computer vision. Whether you’re smoothing an image to remove noise or enhancing features for better detection, filtering techniques based on convolution are everywhere. In this article, we’ll explore image filtering using convolution — understanding the mathematics behind it, and seeing how it’s practically implemented in…

    Read more →

  • Deepfake Detection with Computer Vision

    Imagine this! A video of a world leader giving a speech they never actually delivered, or a celebrity appearing to endorse a product they’ve never even heard of. These aren’t simple photo edits—deepfakes are hyper-realistic, AI-generated videos, images, or audio clips that manipulate reality in ways previously reserved for science fiction. For example, a recent deepfake…

    Read more →

  • Color spaces in OpenCV

    Color spaces are fundamental to how digital images are represented and processed in computer vision. While the RGB (Red, Green, Blue) is the most commonly used, OpenCV supports several other color spaces like HSV (Hue, Saturation, Value), LAB (Lightness, A, B), and YCrCb (Luminance, Chroma Red, Chroma Blue), each offering unique advantages for different image…

    Read more →

  • NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

    NormalCrafter introduces a novel approach for surface normal estimation in videos, leveraging diffusion priors to achieve high spatial fidelity and temporal consistency over arbitrary-length sequences. Key Highlights: Video Diffusion Model Repurposing – Adapts Stable Video Diffusion (SVD) for normal map prediction, maintaining temporal structure instead of RGB generation. Semantic Feature Regularization (SFR) – Aligns intermediate…

    Read more →

  • Image Thresholding using OpenCV

    Image thresholding is one of the most essential and widely used techniques in image processing and computer vision. It transforms a grayscale image into a binary image by setting pixel values to either a maximum or minimum based on a defined threshold. This simple yet powerful method is commonly used in applications such as object…

    Read more →

  • OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors

    OpenLiDARMap presents a GNSS-free mapping framework that combines sparse public map priors with LiDAR data through scan-to-map and scan-to-scan alignment. This approach achieves georeferenced and drift-free point cloud maps. Key Highlights Dual ICP-Based Matching Strategy – Integrates scan-to-scan and scan-to-map ICP matching using robust kernels (Tukey & Cauchy) for drift mitigation and local consistency across LiDAR…

    Read more →

  • Best Books to Master Computer Vision and Deep Learning

    Computer Vision and Deep Learning are the superstars of today’s AI universe, fueling everything from cars that drive themselves to medical tools smart enough to spot issues even seasoned doctors might miss. But let’s face it: navigating the ocean of online tutorials, endless research papers, and the latest “must-try” frameworks can make even the most…

    Read more →