Zhe Wang

 

Email: buptwangzhe2012[at]gmail[dot]com

 

                     

About Me

I am a Senior GenAI Researcher in Adobe Firefly, working on text2video. I spent about 2 years working in AV industry (Cruise LLC) and learn how to build the AI product and grow from an IC to TL. I obtained my Ph.D. from UC Irvine where I was advised by Prof. Charless Fowlkes.

News

  • News! [Oct, 2024] We are hiring research intern working on T2V/T2I/VLM in Firefly for 2025 summer, contact me directly if you are interested ( LINK ).
  • News! [Oct, 2024] Generate Video (beta) is on Firefly Web App ( BLOG ).
  • News! [Sep, 2024] We are bringing generative AI (T2V and GenExtend Model) to video with Adobe Firefly Video Model ( BLOG ).
  • [Mar, 2024] I moved to generative computer vision after working about 10 years in CV related perception problems.
  • [Jul, 2022] One paper on multi-view 3d human pose estimation gets accepted by ECCV 2022.
  • [May, 2022] I uploaded my phd thesis and presentation, have a look if you are interested!
  • [Feb, 2022] Thanks for the continuous interest from Academia for our GPA dataset, I recently provide single video setting meta file, and improve the gaussian blur quality (oval shape blur) to mitigate defect of gaussian blur.
  • [Nov, 2021] I succesfully defended my Ph.D dissertation!

Selected Publications [ Full List ]

PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation
H. Ma, Z. Wang, Y. Chen, D. Kong, L. Chen, X. Liu, X. Yan, H. Tang, and X. Xie
European Conference on Computer Vision (ECCV) , 2022.
[ Paper ] [ Code ]
Robust Estimation of 3D Human Body Pose with Geometric Priors
Z. Wang
Ph.D thesis, 2021
[ Slides ] [ Paper ] [ BibTex ]
The Best of Both Worlds: Combining Model-based and Nonparametric Approaches for 3D Human Body Estimation
Z. Wang, J. Yang, and C. Fowlkes
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), ABAW workshop, 2022
[ Paper ] [ BibTex ]
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation
Z. Wang, H. Chen, X. Li, C. Liu, Y. Xiong, J. Tighe, and C. Fowlkes
IEEE Winter Conf. on Applications of Computer Vision (WACV), 2022
[ Paper ] [ BibTex ] [ Poster ] [ Slides ]
Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation
Z. Wang, D. Shin, and C. Fowlkes
European conference on computer vision (ECCV), 3DPW workshop, 2020
[ Paper ] [ Project Page ] [ BibTex ] [ Arxiv ] [pre-trained GPA model]
Geometric Pose Affordance: 3D Human Pose with Scene Constraints
Z. Wang, L. Chen, S. Rathore, D. Shin, and C. Fowlkes
(Arxiv), 2019
[ Paper ] [ Project Page ] [ Dataset Video ] [Dataset & Model] [ BibTex ]
Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering
Z. Wang, X. Liu, L. Chen, L. Wang, Y. Qiao, X. Xie, and C. Fowlkes
IEEE Winter Conf. on Applications of Computer Vision (WACV), 2018
[ Paper ] [ Poster ] [ Presentation ] [ Code ] [ Talk Video ] [ BibTex ]
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition
Z. Wang, L. Wang, Y. Wang, B. Zhang, and Y. Qiao
IEEE Transactions on Image Processing (TIP), 2017
[ Paper ] [ Extended Abstract ] [ Code& Model ] [ Feature ] [ Project Page ] [ BibTex ]
Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images
L. Wang, Z. Wang, Y. Qiao, and L. Van Gool
International Journal of Computer Vision (IJCV), 2018.
[ Paper ] [ Code& Model ] [ BibTex ]
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool
European Conference on Computer Vision (ECCV), 2016.
[ Paper ] [ Poster ] [ Code ] [ BibTex ]
Real-time Action Recognition with Enhanced Motion Vector CNNs
B. Zhang, L. Wang, Z. Wang, Y. Qiao, and H. Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
[ Paper ] [ Project Page ] [ BibTex ]

Academic Service

    Journal Reviewer

    TVCG, TIP, TMM, PR, IJCV, TPAMI, etc


    Conference Reviewer

    ICML, AAAI, ICLR, NeurIPS, CVPR, ICCV, WACV, BMVC, ECCV, 3DV, Socal NLP, ICCV workshops, CVPR workshops, etc

TA/Readers:

ICS 6D, Discrete Math for Computer Science. ( Fall 2016 )

CS 175, Projects in AI in Minecraft. ( Winter 2017 )

CS 116, Computational Photography and Vision. ( Spring 2017 )

Invited Talks

  • Google Research (03/2024)
  • Apple (02/2024)
  • ECCV 2022 (10/2022)
  • CVPR 2022 (06/2022)
  • Electronic Arts (08/2021)
  • OPPO US Research Center (07/2021)
  • WACV 2018, Lake tahoe (03/2018)
  • Shenzhen Institute of Advanced Technology (01/2018)
  • ICASSP, Shanghai (03/2016)
  • ICCV 2015, Santiago de Chile, Chile (12/2015)

Contests

Outstanding interns I have worked with

  • Peize Sun (HKU), 2023, Cruise LLC
  • Bhavya Goyal (UW Madison), 2022, Cruise LLC
  • This site has been visisted page counter times in total.

    Published with GitHub Pages