Posts by Collection

Recommended citation: Zheng, Y., Huang, Y. K., Tao, R., Shen, Z., & Savvides, M. (2021). Unsupervised disentanglement of linear-encoded facial semantics. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 3917-3926). https://arxiv.org/abs/2103.16605

Query-Centric Trajectory Prediction

Published in CVPR, 2023

Recommended citation: Zhou, Z., Wang, J., Li, Y. H., & Huang, Y. K. (2023). Query-centric trajectory prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 17863-17873). https://openaccess.thecvf.com/content/CVPR2023/papers/Zhou_Query-Centric_Trajectory_Prediction_CVPR_2023_paper.pdf

Enhanced Training of Query-Based Object Detection via Selective Query Recollection

Published in CVPR, 2023

Recommended citation: Chen, F., Zhang, H., Hu, K., Huang, Y. K., Zhu, C., & Savvides, M. (2023). Enhanced training of query-based object detection via selective query recollection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 23756-23765). https://arxiv.org/abs/2212.07593

QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction

Published in arXiv, 2023

Recommended citation: Zhou, Z., Wen, Z., Wang, J., Li, Y. H., & Huang, Y. K. (2023). Qcnext: A next-generation framework for joint multi-agent trajectory prediction. arXiv preprint arXiv:2306.10508. https://arxiv.org/abs/2306.10508

TOFG: Temporal Occupancy Flow Graph for Prediction and Planning in Autonomous Driving

Published in IEEE Transactions on Intelligent Vehicles (TIV), 2023

Recommended citation: Wen, Z., Zhang, Y., Chen, X., Wang, J., Li, Y. H., & Huang, Y. K. (2023). Tofg: Temporal occupancy flow graph for prediction and planning in autonomous driving. IEEE Transactions on Intelligent Vehicles. https://ieeexplore.ieee.org/document/10185140

A Reference-Based 3D Semantic-Aware Framework for Accurate Local Facial Attribute Editing

Published in IEEE International Joint Conference on Biometrics (IJCB), 2024

Recommended citation: Huang, Y. K., Zheng, Y., Su, Y. S., Bolimera, A., Zhang, H., Chen, F., & Savvides, M. (2024, September). A Reference-Based 3D Semantic-Aware Framework for Accurate Local Facial Attribute Editing. In 2024 IEEE International Joint Conference on Biometrics (IJCB) (pp. 1-10). IEEE. https://arxiv.org/abs/2407.18392

BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction

Published in NeurIPS, 2024

Recommended citation: Zhou, Z., Haibo, H.U., Chen, X., Wang, J., Guan, N., Wu, K., Li, Y.H., Huang, Y.K. and Xue, C.J., 2024. Behaviorgpt: Smart agent simulation for autonomous driving with next-patch prediction. Advances in Neural Information Processing Systems, 37, pp.79597-79617. https://arxiv.org/abs/2405.17372

ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling

Published in CVPR, 2025

Recommended citation: Zhou, Z., Zhou, H., Hu, H., Wen, Z., Wang, J., Li, Y. H., & Huang, Y. K. (2025). ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling. In Proceedings of the Computer Vision and Pattern Recognition Conference (pp. 1612-1621). https://arxiv.org/abs/2411.11911

VTutor: An Animated Pedagogical Agent SDK that Provide Real Time Multi-Model Feedback

Published in AIED, 2025

Recommended citation: Chen, E., Lin, C., Huang, Y. K., Tang, X., Xi, A., Lin, J., & Koedinger, K. (2025). VTutor: An Animated Pedagogical Agent SDK that Provide Real Time Multi-Model Feedback. arXiv preprint arXiv:2505.06676. https://arxiv.org/abs/2505.06676

Yu-Kai Huang

Posts by Collection

awards

patents

projects

publications