Xuyang CaoAlgorithm Engineer, Ph.D.
JD.com, Beijing, China
|
![]() |
Biography
Dr. Xuyang Cao is an algorithm engineer at JD Health Inc. in JD.com, specializing in AIGC and multimodal research. Over the past year, his primary focuses has been on multimodal deep learning algorithms, with an emphasis on audio-driven low-cost talking face generation technology exploration and application.
Since 2016, He has been engaged in research in computer vision, image processing, and pattern recognition. He completed his undergraduate, master’s, and Ph.D. degrees at Beijing Jiaotong University. During his Ph.D., his research focused on semantic segmentation and semi-supervised learning, under the supervision of Professors Houjin Chen and Yanfen Li. Additionally, He closely collaborated with ProfessorYahui Peng during his doctoral studies.
Education
Ph.D, School of Electronic and Information Engineering, Beijing Jiaotong University | 2017-2022 |
Master Candidate, School of Electronic and Information Engineering, Beijing Jiaotong University | 2016-2017 |
Bachelor of Engineering, School of Electronic and Information Engineering, Beijing Jiaotong University | 2012-2016 |
Projects
Audio Driven Talking Face + LLM Technology and Application Innovation | 2024.01-2025.01 | |
|
||
The Brain Science Project - Brain-Computer Interface (BCI) - Brain Disease Diagnosis Based on Medical Imaging | 2023.02-2024.01 | |
|
||
Research on Deep Learning based Breast Mass (Ultrasound) and Lung Mass (MRI) Segmentation | 2018.09-2022.04 | |
|
||
High Speed Train Gear Defect Detection Based on Computer Vision | 2018.09-2022.04 | |
|
||
Geometric Parameters Measurement of an Overhead Line System | 2018.09-2022.04 | |
|
Publications
G. Wang, X. Cao, S. An, F. Fan, C. Zhang, J. Wang, F. Yu, Z. Wang. Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Classification, ICIGP, 2025. [paper] | |
X. Cao, G Wang, S Shi, J Zhao, Y Yao, J Fei, M Gao. JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation. Arxiv, 2024. [paper] [code][project] | |
S. Shi, X. Cao, J. Zhao, G. Wang. JoyHallo: Digital human model for Mandarin. Arxiv, 2024. [technical report] [code][project] | |
Z. Gao, Y. Guo, G. Wang, X. Chen, X. Cao, C. Zhang, S. An, F. Xu. Robust deep learning from incomplete annotation for accurate lung nodule detection. Computers in Biology and Medicine, 2024, 173:108361. [paper] | |
X. Cao, H. Chen, Y. Li, Y. Peng, Y. Zhou, L. Cheng, T. Liu, D. Shen. Auto-DenseUnet: Searchable Neural Network Architecture for Tumor Segmentation in 3D Automated Breast Ultrasound. Medical Image Analysis, 2022, 82: 102589. [paper] | |
Y. Zhou, H. Chen, Y. Li, X. Cao, S. Wang, D. Shen. Cross-Model Attention-Guided Tumor Segmentation for 3D Automated Breast Ultrasound (ABUS) Images. IEEE Journal of Biomedical and Health Informatics, 2022, 26(1): 301-311. [paper] | |
X. Cao, H. Chen, Y. Li, Y. Peng, S. Wang, L. Cheng. Uncertainty Aware Temporal-Ensembling Model for Semi-supervised ABUS Mass Segmentation. IEEE Transactions on Medical Imaging, 2021, 40(1):431-443. [paper] | |
X. Cao, H. Chen, Y. Li, Y. Peng, S. Wang, L. Cheng. Dilated Densely Connected U-Net with Uncertainty Focus Loss for 3D ABUS Mass Segmentation. Computer Methods and Programs in Biomedicine, 2021, 209: 106313. [paper] | |
J. Li, H. Chen, Y. Li, Y. Peng, N. Cai, X. Cao. AMRSegNet: Adaptive Modality Recalibration Network for Lung Tumor Segmentation on Multi-Modal MR Images. Multimedia Tools and Applications, 2021, 80: 33779–33797. [paper] | |
X. Cao, H. Chen, Y. Li, Y. Peng, Y. Zhou, L. Cheng. Boundary Loss with Non-Euclidean Distance Constraint for ABUS Mass Segmentation. 2020 CISP-BMEI, Chengdu, China, 2020, pp: 645-650. [paper] | |
Y. Peng, X. Cao, H. Chen, Y. Li, J. Li, X. Wang. Preliminary Study on Noise and Artifact Reduction in Phase-Contrast CT Image of Tristructural-Isotropic Coated Fuel Particle (in Chinese). Acta Electronica Sinica, 2019, 47(2): 448-453. [paper] | |
C. Wang, F. Li, Y. Li, H. Chen and X. Cao. A Defect Status Detecting Method for External Gear in Railway. 2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC), Chongqing, 2018, pp: 123-127. [paper] | |
Y. Li, X. Cao, H. Chen, L. Zhang, N. Yang. Defect Status Detection Method Based on Machine Vision for External Gear in Train (in Chinese). Journal of The China Railway Society, 2018, 40(12):33-41. [paper] | |
J. Wei, X. Cao, H. chen, Y. li. Research on benign and malignant masses classification in mammogram (in Chinese). Journal of Beijing Jiaotong University, 2017, 41(5): 73-. [paper] |
Patents&Translated Books
Y. Peng, W. Jiang, Z. Zhu, H. Yang, X. Cao, H. Chen. A method of Measuring the Geometric Parameters of an Overhead Line System by using Geometric Magnification and Monocular Vision. China, CN201810182553.1, 2018-11-13. [Link] |
Y. Peng, C. Zhang, B. Zheng, J. Yin, X. Cao, H. Chen. A method and a Device for Measuring the Geometric Parameters of an Overhead Line System by using Scale Factors and Frame Differences. China, CN201710464403.5, 2017-06-19. [Link] |
Y. Zhou, X. Cao. Neural Networks with TensorFlow 2, Apress, 2020. [translated] [Link] |
Personal Qualifications
Research Interests: | AIGC, Multimodal, Image Processing, Pattern Recognition, Computer Vision, Artificial Intelligence | |
Language Skills: | Chinese (Monther Tongue) | English (Proficient in Listening, Speaking, Reading, Writing), IELTS Score: 7 | |
Computer Skills: | Programming Languages: Python, C++ | Computer Vision: Pytorch, OpenCV | Others: Linux, Vim, LaTex | |
Hobbies: | Personal blog with over 330,000 views, CSDN Blog Expert | Enjoys reading, running, and traveling |