🧐 About Me

Hi there! I am a first-year PhD student in Computer Science at the ETH Zurich, under the supervision of Prof. Florian Tramèr. I completed my master’s degree in Software Engineering at Zhejiang University in March 2023, advised by Prof. Chao Wu. Before that, I received my Bachelor’s degree at Hainan University in July 2020.

Research Interests: I am mainly interested in data-centric AI or data-driven machine learning, including data privacy, data security, data efficiency, and data-related applications. My research investigates how to elevate data-centric approaches to improving the performance of machine learning models. Previously, I focused on the following research topics:

  • Data Efficiency: Data-Free Knowledge Distillation, Dataset Condensation
  • Data Security: Adversarial Examples, Model Inversion
  • Data Privacy: Membership Inference Attack
  • Data-related Applications: Federated Learning, Imbalanced Learning, Continual Learning

🤔 For my PhD study, I am currently exploring the aspects of privacy and utility associated with synthetic data, particularly the use of generative models such as GANs or Diffusion models to obtain synthetic data.

🔥 News

📝 Selected Publications

(* indicates equal contribution; # indicates corresponding authorship.)

CVPR 2022
sym

Towards Efficient Data-Free Black-box Adversarial Attack
Jie Zhang*, Bo Li*, Jianghe Xu, Shuang Wu, Shouhong Ding, Chao Wu#. (CVPR 2022) code

  • In this paper, by rethinking the collaborative relationship between the generator and the substitute model, we design a novel black-box attack framework. The proposed method can efficiently imitate the target model through a small number of queries and achieve high attack success rate.
ICML 2022
sym

Federated Learning with Label Distribution Skew via Logits Calibration
Jie Zhang, Zhiqi Li, Bo Li, Jianghe Xu, Shuang Wu, Shouhong Ding, Chao Wu#. (ICML 2022)

  • In this work, we investigate the label distribution skew from a statistical view. We demonstrate both theoretically and empirically that previous methods based on softmax crossentropy are not suitable, which can result in local models heavily overfitting to minority classes and missing classes. Then, we propose FedLC (Federated learning via Logits Calibration), which calibrates the logits before softmax cross-entropy according to the probability of occurrence of each class.
CVPR 2023, highlight
sym

Accelerating Dataset Distillation via Model Augmentation
Lei Zhang*, Jie Zhang*, Bowen Lei, Subhabrata Mukherjee, Xiang Pan, Bo Zhao, Caiwen Ding, Yao Li, Dongkuan Xu. (CVPR 2023) code

  • In this paper, we assume that training the synthetic data with diverse models leads to better generalization performance. Thus we propose two model augmentation techniques, i.e., using early-stage models and weight perturbation to learn an informative synthetic set with significantly reduced training cost. Extensive experiments demonstrate that our method achieves up to 20× speedup and comparable performance on par with state-of-the-art baseline methods.
NeurIPS 2022
sym

DENSE: Data-Free One-Shot Federated Learning
Jie Zhang*, Chen Chen*, Bo Li, Lingjuan Lyu, Shuang Wu, Shouhong Ding, Chunhua Shen, Chao Wu#. (NeurIPS 2022) code

  • The paper focuses on one-shot federated learning, i.e., the server can learn a model with a single communication round. The proposed FedSyn method has two stages: first, training a generator from the ensemble of models from clients; second, distilling the knowledge of the ensemble into a global model with synthetic data. We validate the efficacy of FedSyn by conducting extensive experiments on 6 different datasets with various non-IID settings generated from Dirichlet distributions. Results can well support that the proposed method consistently outperforms all the baselines.
AAAI 2023, oral
sym

Delving into Adversarial Robustness of Federated Learning
Jie Zhang*, Bo Li*, Chen Chen, Lingjuan Lyu, Shuang Wu, Shouhong Ding, Chao Wu#. (AAAI 2023)

  • To facilitate a better understanding of the adversarial vulnerability of the existing FL methods, we conduct comprehensive robustness evaluations on various attacks and adversarial training methods. Moreover, we reveal the negative impacts induced by directly adopting adversarial training in FL, which seriously hurts the test accuracy, especially in non-IID settings. In this work, we propose a novel algorithm called Decision Boundary based Federated Adversarial Training (DBFAT), which consists of two components (local re-weighting and global regularization) to improve both accuracy and robustness of FL systems.

🎖 Honors and Awards

  • 2021.05 We won the first prize on CVPR21 Workshop (Adversarial Machine Learning in Real-World Computer Vision Systems and Online Challenges, rank: 1 / 1558).
  • 2022.10 China National Scholarship, Zhejiang University, 2022
  • Outstanding Student Scholarship, First Prize, Hainan University, 2018, 2019, 2020.

📖 Educations

  • 🎓 2020.09 - 2023.03, Master, Zhejiang University, Hangzhou.
  • 🎓 2016.09 - 2020.06, Undergraduate, Hainan University, Haikou.

💬 Services

  • Journal Reviewer:
    • IEEE Transactions on Neural Networks and Learning Systems
    • Neural Networks
  • Conference Reviewer: ICLR 2022, AAAI 2022, CVPR 2022, ICML 2022, ECCV2022.

💻 Internships

🎙 Miscellaneous

Travel

I enjoy the time traveling with my families and friends. I am always excited about visiting new places and knowing different cultures.

My cat

My girlfriend and I have three cats together, they are very adorable and have brought a lot of fun to our lives!

图片名称 图片名称 图片名称