Taneem Ullah Jan
I am working and contributing to the advancement in the field of 3D neural head avatars and virtual humans using computer graphics, computer vision, and deep neural networks. I serve as an AI Researcher at VOLV AI, where I direct the machine learning team in exploring innovative solutions in 3D computer vision for virtual try-on and digital human synthesis. Previously, I worked as a Research AI Engineer at BHuman AI, leading the research and development for their core generative AI products.
connect and contact
research and projects
-
OmniFit-3D: A Unified Framework for 3D Virtual Try-On
with Pose-Adaptive Realism
A unified framework for 3D virtual try-on that transforms simple 2D images into realistic 3D representations, by efficiently integrating clothing with the human body in a pose-adaptive manner.
-
LipSyncFace: High-Fidelity Audio-Driven and
Lip-Synchronized Talking Face Generation
A two-stage unified audio-driven talking face generation framework, which can render high-fidelity, lip-synchronized videos with improved inference speed.
-
Beyond CNNs: Encoded Context for Image Inpainting with
LSTMs and Pixel CNNs
Current image inpainting techniques are too heavy; this paper introduces a Row-wise Flat Pixel LSTM, a small hybrid model for the efficient and high-quality restoration of small images.
-
lipsync2: Talking Face Generation with Most Accurate Lip
Synchronization
A robust and efficient talking face generation model with highly accurate lip synchronization and full facial expressiveness with more extended audio and high-quality video resolutions.
-
face2face: One-Shot Talking Head Video Generation from a
Static Source Image
An unsupervised one-shot talking head video generation model using neural rendering and motion transfer techniques with non-linear transformation to animate static images.
-
face-swapping: Swapping Faces on Target from given Sources
Innovative face-swapping model that preserves the source identity features accurately while seamlessly adapting target attributes applicable to images and videos.
-
HTML Code Generation from Images with Deep Neural
Networks
An accurate deep learning model converting GUI mockups into HTML code, streamlining web development for non-programmers.