
ALEKSANDR NAGAEV
] LinkedIn | a GitHub | û Telegram | ç Website
PROFILE
ML R&D Team Lead with 5+ years of expertise in NLP and Computer Vision, dedicated to advancing state-of-the-art AI
through novel algorithms and architectures. Proven ability to lead research projects from conception to publication and deliver
innovative solutions for complex applied problems. Strong background in both theoretical research and practical, optimized
deployment.
EXPERIENCE
Team Lead, RnD Vision June 2020 – Present
SberDevices Moscow, Russia
Leading a team of 9 researchers and engineers in cutting-edge computer vision and multimodal AI development
• Multimodal LLM Project: Integrated Video modality into GigaChat multimodal architecture, resulting in 5-10%
performance improvement across all benchmarks including image-centric tasks.
• RSL Translation System: Created the world’s rst real-time Russian Sign Language to text translator with 64 BLEU-4
score, enabling seamless communication for deaf community.
• SLOVO Framework: Led the creation of the world’s rst open-source framework for isolated sign language recognition.
Achieved SOTA results: 65.3% accuracy on WLASL (American SL) and 87.3% on the proprietary RSL benchmark. Research
published at ICVS’23 and presented at a CVPR’23 workshop.
• Real-time Talking Head Assistant: Architected an end-to-end system for a video conferencing chatbot, integrating a
spotter, ASR-API, RuGPT-3, a proprietary text2landmark model for lip-sync, and a head animation system.
• Dynamic Gesture Recognition: Designed an algorithm for device control using heuristics and hand detection models,
enabling precise gesture-based interaction.
• Facial Enhancement System: Developed a real-time 3D morphing and skin smoothing pipeline achieving 30 FPS on
mid-tier CPUs using SSD face detection and OpenCV optimizations.
TECHNICAL SKILLS
Languages & Frameworks Python, PyTorch, OpenCV, Pandas, Hugging Face
ML Domains NLP: Transformers, LLMs, MultiModal LLMs
Computer Vision: Image/Video Processing, Sign Language Recognition, Gesture Recognition
Deployment & Tools Docker, Git, CI/CD, ONNX, CPU/Edge Optimization
Languages Russian (Native), English (Intermediate - B1)
PUBLICATIONS AND PRESENTATIONS
Research Proles
h-index: 5
• Google Scholar
• Orcid
• Scopus
Articles
• Habr
Technical Talks
• Saint HighLoad++ 2024 l
• GIGA RnD Day 2024 l
• HighLoad++ 2023 l
• X5 Data science meetup 2023 l
EDUCATION
Master of Data Science, Ural Federal University 2019 – 2022
Instructed undergraduate students in Machine Learning and Python programming.
Bachelor of Computer Science, Ural Federal University 2015 – 2019
Led cross-functional teams in semester-long academic projects using Agile methodology.