Image thresholding is one of the most essential and widely used techniques in image processing and computer vision. It transforms a grayscale image into a binary image by setting pixel values to either a maximum or minimum based on a defined threshold. This simple yet powerful method is commonly used in applications such as object…
OpenLiDARMap presents a GNSS-free mapping framework that combines sparse public map priors with LiDAR data through scan-to-map and scan-to-scan alignment. This approach achieves georeferenced and drift-free point cloud maps. Key Highlights Dual ICP-Based Matching Strategy – Integrates scan-to-scan and scan-to-map ICP matching using robust kernels (Tukey & Cauchy) for drift mitigation and local consistency across LiDAR…
Computer Vision and Deep Learning are the superstars of today’s AI universe, fueling everything from cars that drive themselves to medical tools smart enough to spot issues even seasoned doctors might miss. But let’s face it: navigating the ocean of online tutorials, endless research papers, and the latest “must-try” frameworks can make even the most…
Image annotation is a crucial step in computer vision that involves adding meaningful information—such as shapes, labels, or markers—to an image. This process is widely used in applications like object detection, image labeling, dataset preparation, and visual storytelling. Whether it’s drawing a bounding box around a face or labeling specific objects in a scene, annotation…
Computer vision is one of artificial intelligence’s most dynamic and rapidly advancing areas, enabling machines to interpret and understand the visual world. From self-driving cars that detect and avoid pedestrians to smartphone apps that instantly translate text, the power of computer vision drives countless everyday technologies. In this blog, we’ll explore five practical and impactful…
MedSAM2 introduces a robust foundation model for promptable segmentation in 3D medical images and temporal video data, built by fine-tuning SAM2.1 on a large-scale curated medical dataset. Key Highlights: 3D & Video Segmentation Foundation Model – Tailors SAM2.1-Tiny for medical domains, supporting volumetric scans (CT, MRI, PET) and sequential video modalities (ultrasound, endoscopy) with a…
Fingerprint matching plays a crucial role in various security applications, such as identity verification and criminal investigations. While most fingerprint matching systems rely on large machine learning models and sophisticated algorithms, it is also possible to perform this task with simpler, more accessible techniques. In this article, we’ll explore how basic OpenCV feature extraction operations,…
The OpenCV community is set to gather for an exceptional event, The OpenCV-SID Conference on Computer Vision & AI (OSCCA). Our conference is one full day, and will occur on May 12th, 2025. OSCCA is hosted in conjunction with Display Week, the world’s largest technical symposium and exhibition, running from May 11th to 15th in…
GenZ-ICP introduces an innovative iterative Closest Point (ICP) method that enhances LiDAR-based pose estimation by adaptively integrating point-to-plane and point-to-point error metrics, ensuring robust performance across diverse and degenerative environments. Key Highlights Adaptive Error Metric Integration – Combines point-to-plane and point-to-point error metrics, leveraging their complementary strengths for improved pose estimation accuracy. Planarity-Based Correspondence Classification…
AirSLAM introduces a hybrid visual SLAM approach that integrates deep learning for feature detection with traditional backend optimization. Key highlights: Unified Feature Extraction: Employs a convolutional neural network (CNN) to simultaneously extract keypoints and structural lines, enhancing feature richness. Coupled Feature Optimization: Associates, matches, triangulates, and optimizes point and line features in a unified framework,…