Zhiyuan Peng

Zhiyuan Peng

Research Assistant

Santa Clara University

Biography

I am a Ph.D. candidate at Santa Clara University supervised by Prof. Yi Fang.

Interests
  • Probabilistic Graphical Model
  • Natural Language Processing
  • Information Retrieval
Education
  • Ph.D. in Computer Science and Engineering, Present

    Santa Clara University

  • MS in Computer Science and Engineering, 2020

    Santa Clara University

  • MS in Electronics and Communications Engineering, 2017

    Beijing Institute of Technology

  • BS in Electronics and Communications Engineering, 2014

    Beijing Jiaotong University

Experience

 
 
 
 
 
Data Scientist
Jun 2022 – Sep 2022 Sunnyvale, CA, USA

Responsibilities include:

  • Developed a novel entity‑aware multi‑task learning model for query understanding to reduce 75% GPU resource usage, enhance key metrics as well as development and maintenance efficiency
  • Collaborated with colleagues to do online A/B test and obtained statistical significance lifts over current production: GMV (0.51%), Order (0.65%), UNITS(1.08%), ATC (0.65%), and P99 latency (18.6%)
  • Authored a paper Entity-aware Multi-task Learning for Query Understanding at Walmart accepted by KDD'23
 
 
 
 
 
Machine Learning Engineering
Jun 2020 – Dec 2020 San Francisco, CA, USA

Responsibilities include:

  • Solved no evaluation data issue by tagging about 300 call notes with named entity recognition (NER) labels through brat rapid annotation tool
  • Built an intent classification model for call notes taken by pharmaceutical sales representatives to predict doctors’ intents and a NER model for extracting valuable information from call notes
  • Developed a suggestion recommendation demo app using Flask for pharmaceutical sales representatives and deployed it on AWS EC2

Recent & Upcoming Talks

Recent Publications

Quickly discover relevant content by filtering publications.
(2023). Semi-Supervised Named Entity Recognition with Data Augmentation by Structured Consistency Training. Available at SSRN 4295244.

PDF Cite

(2022). DIANES: A DEI Audit Toolkit for News Sources. Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval.

PDF Cite

(2021). Multi-label classification of short texts with label correlated recurrent neural networks. Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval.

PDF Cite Code

(2016). High-precision TT&C signal simulation technology based on Lagrange interpolation in high dynamic environment. 2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP).

PDF Cite

Popular Topics

Contact