Karam Mahfod

AI & Computer Vision Engineer | Video Coding & Compression Specialist

Saint Petersburg, RU.

About

Highly accomplished AI & Computer Vision Engineer specializing in video coding, compression systems, and real-time AI pipelines for large-scale, production-grade applications. Proven ability to bridge advanced academic research with industry innovation, driving significant advancements in performance, efficiency, and perceptual quality across complex systems. Seeking to leverage deep expertise in machine learning, signal processing, and high-performance computing to solve challenging problems in target roles.

Work

Cradle Vision

AI & Computer Vision Engineer

Remote

Jun 2025

→

Dec 2025

Summary

Developed and deployed real-time AI pipelines for large-scale, production-grade applications, focusing on computer vision and machine learning solutions.

Highlights

Engineered and deployed a real-time AI pipeline for attendance tracking, capable of simultaneously processing over 300 4K camera streams, significantly reducing GPU load through ROI-based detection while maintaining high recognition accuracy.

Developed a multi-model pipeline leveraging pose estimation and action classification to recognize various student behaviors and analyze teacher-student interaction patterns in real-time, providing actionable insights for educational improvement.

Spearheaded platform expansion initiatives, integrating voice-based session summaries and advanced multi-camera tracking, which enhanced inference performance for large-scale deployments.

ITMO University

Research Engineer - AI-Based Image Compression (JPEG-AI)

St. Petersburg, Saint Petersburg, Russian Federation

Dec 2024

→

Dec 2025

Summary

Led research and development efforts for next-generation neural image compression methods, contributing to the JPEG-AI standardization.

Highlights

Led a team of four engineers in the research and development of next-generation neural image compression methods, contributing directly to the JPEG-AI standardization effort.

Developed a novel perceptual quality metric, trained on Mean Opinion Score (MOS) and validated against subjective human evaluations, achieving superior correlation with human perception compared to traditional metrics (VMAF, MS-SSIM, PSNR).

Delivered significant gains in compression efficiency without perceptual quality loss, achieving 25% bitrate reduction for the 0.075 bpp model and 31% for the 0.012 bpp model compared to baseline methods.

Optimized fixed-quality and fixed-bitrate training paradigms, balancing efficiency and visual fidelity across diverse model sizes for enhanced image compression.

Vishare Technology

Software Engineer — H.265/HEVC Optimization

Remote, Hong Kong, Hong Kong

Jun 2021

→

Feb 2025

Summary

Contributed to the design, optimization, and enhancement of H.265/HEVC video codecs, improving performance and efficiency.

Highlights

Optimized H.265/HEVC video codecs, enhancing compression efficiency and runtime performance across multiple platforms by redesigning encoder/decoder modules and optimizing rate-distortion, motion estimation, and entropy coding.

Implemented advanced multithreading strategies and low-level optimizations, boosting throughput and scalability, and significantly improving pipeline concurrency and memory utilization in real-time and batch processing.

Developed a feature-rich 32-bit RISC-V simulator, incorporating multithreading, MESI cache coherence, and branch prediction for hardware-software co-design and codec acceleration.

Created custom profiling and debugging utilities utilizing GProf, Valgrind, and low-level instrumentation to pinpoint bottlenecks and validate system-level performance improvements.

Contributed to CI/CD workflows, cross-platform builds, and automated testing pipelines, leveraging containerized environments and continuous benchmarking for robust video compression solutions.

ITMO University

Research Engineer — H.265/HEVC-Based Transcoding System

St. Petersburg, Saint Petersburg, Russian Federation

Dec 2023

→

Dec 2024

Summary

Designed and implemented an H.265/HEVC-based transcoding system for efficient video processing workflows.

Highlights

Designed and implemented a comprehensive transcoding pipeline utilizing OpenHEVC and x265, enabling efficient video re-encoding and format conversion for large-scale video processing workflows.

Integrated SUR-based quantization and HVS-inspired pre-filtering to optimize quantization decisions, significantly improving both compression performance and visual quality.

Re-architected decision logic to shift computationally expensive processes from decoder to encoder, significantly reducing overall encoding complexity and runtime.

Achieved equal or superior compression efficiency compared to H.266 for the enhanced H.265/HEVC pipeline across multiple video datasets, validated through BD-Rate analysis.

Syriatel

C++ Developer - Network and Backend Systems

Damascus, Damascus, Syrian Arab Republic

Mar 2020

→

May 2021

Summary

Developed backend C++ modules for telecom network management and gained foundational experience in large-scale systems.

Highlights

Developed robust backend C++ modules for real-time telecom network management systems, ensuring high availability and performance.

Collaborated effectively with cross-functional teams to deliver new features and resolve complex issues in a fast-paced development environment.

Gained foundational experience in large-scale system development and maintenance, contributing to critical infrastructure projects post-graduation.

Education

ITMO University

St. Petersburg, Saint Petersburg, Russian Federation

Sep 2023

→

Dec 2025

Ph.D.

Distributed Video Coding and Transmission using Machine Learning

ITMO University

St. Petersburg, Saint Petersburg, Russian Federation

Sep 2021

→

Jun 2023

M.Sc.

Network and Cloud Computing

Grade: with honors

Higher Institute for Applied Sciences and Technology (HIAST)

Damascus, Damascus, Syrian Arab Republic

Sep 2015

→

Aug 2020

B.Sc.

Computer Science

Grade: 4.7 GPA

Publications

Multi-Hypothesis based Distributed Video Coding using Error-Correction Decoder Feedback

Jan 2025

Published by

DSPA

Summary

Exploration of multi-hypothesis distributed video coding with error-correction decoder feedback.

Inter-frame and Inter-view Quality Enhancement in Distributed Multi-view Video Coding

Jan 2025

Published by

International Symposium on Problems of Redundancy in Information and Control Systems

Summary

Submitted research on quality enhancement in distributed multi-view video coding.

H.265/HEVC Decoding via Iterative Recovery from Incomplete Quantized Measurements

Jan 2025

Published by

IEEE signal processing letter

Summary

Submitted research on H.265/HEVC decoding using iterative recovery techniques.

Q-Learning Based Adaptive Multipath Routing Algorithm for Data Centre Networks

Jan 2024

Published by

RusAutoCon

Summary

Research on an adaptive multipath routing algorithm using Q-Learning for data center networks.

Languages

Arabic

English

Russian

Skills

Programming Languages

C/C++, Python, Bash, MATLAB, Verilog.

Video & Image Processing

FFmpeg, GStreamer, DeepStream, OpenHEVC, x265, OpenCV, JPEG-AI, H.265/HEVC, Video Coding, Compression Systems.

Machine Learning & AI

PyTorch, TensorFlow, TensorRT, ONNX, Triton Inference Server, Computer Vision, Pose Estimation, Action Classification, Neural Networks, AI Pipelines.

System & Performance Engineering

Multithreading (OpenMP, Pthreads), SIMD (AVX2/AVX-512), GProf, Valgrind, GDB, Performance Profiling, System Optimization, Real-Time Systems, Large-Scale Deployments, Distributed Systems.

Development & Operations

CMake, Make, Docker, Git, CI/CD, Cross-platform Builds, Automated Testing, Containerized Environments.

Tools & Libraries

NumPy, Pandas, LaTeX, Visual Studio, VS Code, PyCharm.

Operating Systems

Linux, Windows.

Research & Analysis

Algorithmic Problem-Solving, Mathematical Foundations (Probability Theory, Number Theory, Numerical Methods), Signal Processing, Experiment Design, Algorithmic Performance Evaluation, Statistical Analysis, Visual Analysis, Mean Opinion Score (MOS), BD-Rate Analysis.

Leadership & Collaboration

Project Leadership, Team Coordination, Cross-functional Collaboration, Technical Presentation, Documentation, Teamwork.