Leopoldo López Reverón

Machine Learning Engineer — Computer Vision & Applied AI

About me

I am a Machine Learning Engineer specializing in Computer Vision and Audio Intelligence, with experience developing end-to-end systems that encompass everything from dataset preparation to optimized inference and the creation of working demos I work with modern vision architectures (YOLO, U-Net, DPT, OCR pipelines) and spectrogram-based models for audio. I design and run ablation studies, model comparisons, advanced metric analysis (F1 macro/micro, mAP, precision/recall, logits, probabilities by class), and visualizations that allow understanding the internal behavior of the model I am particularly interested in building reproducible pipelines, optimizing models for production and integrating explainability techniques. My approach is practical, results-oriented, and based on a deep understanding of each stage of the model lifecycle

Featured projects

Depth Estimation with DPT (3D Perception)
Monocular depth estimation in coastal scenes
Imagen original Heatmap GIF depth estimation
Monocular depth estimation of the Moon
Imagen original Heatmap GIF depth estimation
Sub-project: 360° Panorama with PTZ camera + Depth Map

Generating a 360° panorama from a PTZ camera and estimating the depth of the entire scene

Both images can be enlarged by clicking to view reconstruction details

Geofencing using Photogrammetry
Interactive visualization of the KML file
Imagen original geofencing Máscara geofence Resultado geofence Zoom geofence
Segmentation of Cancer Cells
Generation of malignant zone masks

Biomedical segmentation project that detects and delineates cancerous regions in microscopic images The model generates precise masks that allow for the analysis of the extent and morphology of malignant areas

The images include: original sample and generated mask

PPE Detection System
Real-time PPE compliance
  • InternImage‑L, ViTPose and YOLOv7 are used for the detection and analysis of EPI
  • Optimization with TensorRT and deployment in production environments
YOLOv7 InternImage‑L ViTPose TensorRT
Visualization of the PPE Detection system

Examples of the Personal Protective Equipment (PPE) detection system in different scenarios

The images show detection of helmets, vests and other safety equipment, with bounding boxes and real-time sorting

Pipeline End‑to‑End

The complete system includes all phases of the model lifecycle:

  • Dataset: Collection, cleaning, and annotation of PPE images
  • Train: YOLOv7 model with augmentations and validation
  • Inference: Optimized script for real-time prediction

Ablation Studies

Model Comparisons

Visual comparisons of models

Final Results

Maritime OCR + AIS Integration
OCR system visual pipeline
Imagen original OCR OCR procesado Heatmap OCR GIF OCR
Super-Resolution Satellite
Before/after comparison + zoom, source SENTINEL - II
Imagen satelital original Imagen super-resuelta Zoom original Zoom super-resuelto

2x2 comparison showing the original image, the super-resolution version and zooms of both.

Multilabel Classification of Animal Sounds
Mel Spectrograms + CNN + Advanced Metrics

A multilabel classification system capable of identifying multiple animal species from audio. Each clip is transformed into a Mel spectrogram and processed using a convolutional neural network. The model produces probabilities per class, logits, and advanced metrics such as F1, mAP, macro/micro accuracy, and recall.

Spectrogram of MEL
Audio Frequency
Precision micro
Macro precision
micro Recall
Macro Recall
F1 micro
F1 macro
Training loss
Validation loss
Class probability
Histogram probability classes
Logits
Logits Histogram
RAG for Document Management
Increased recovery per generation
  • Architecture with chunking, embeddings, and semantic retrieval
  • Custom libraries for preprocessing and integration with conversational systems.
Python ChromaDB HuggingFace
GitHub

Tech stack

Python PyTorch TensorFlow HuggingFace OpenCV FFMPEG YOLO InternImage ViTPose DPT TensorRT ONNX Kalman Filter EasyTracker FastAPI Docker AWS CI/CD PostgreSQL MongoDB ChromaDB RAG OCR Geospatial AI ETL Segmentación semántica

Publications

Contact