Hough Transform - Parallel Implementation
Parallel implementation of the Hough transform for line and circle detection in images. Offers two implementations: multi-threaded (C++11 threads) for shared-memory systems and MPI-based for distributed clusters. Both preprocessing (grayscale conversion, Canny edge detection) and the Hough transform itself are parallelized. Achieves 5–7× speedup on 8 cores (threaded) and 3–4× speedup on 4 nodes (MPI). Built with C++17, OpenCV, and CMake. Generates edge maps, annotated results, and performance metrics.