default search action
Zhenglun Kong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c24]Xuan Shen, Peiyan Dong, Lei Lu, Zhenglun Kong, Zhengang Li, Ming Lin, Chao Wu, Yanzhi Wang:
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge. AAAI 2024: 18944-18951 - [c23]Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang:
Rethinking Token Reduction for State Space Models. EMNLP 2024: 1686-1697 - [c22]Pu Zhao, Fei Sun, Xuan Shen, Pinrui Yu, Zhenglun Kong, Yanzhi Wang, Xue Lin:
Pruning Foundation Models for High Accuracy without Retraining. EMNLP (Findings) 2024: 9681-9694 - [c21]Zhengang Li, Alec Lu, Yanyue Xie, Zhenglun Kong, Mengshu Sun, Hao Tang, Zhong Jia Xue, Peiyan Dong, Caiwen Ding, Yanzhi Wang, Xue Lin, Zhenman Fang:
Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers. ICS 2024: 324-337 - [c20]Pinrui Yu, Dan Luo, Timothy Rupprecht, Lei Lu, Zhenglun Kong, Pu Zhao, Yanyu Li, Octavia I. Camps, Xue Lin, Yanzhi Wang:
FasterVD: On Acceleration of Video Diffusion Models. IJCAI 2024: 8838-8842 - [i25]Xuan Shen, Zhenglun Kong, Changdi Yang, Zhaoyang Han, Lei Lu, Peiyan Dong, Cheng Lyu, Chih-hsiang Li, Xuehang Guo, Zhihao Shu, Wei Niu, Miriam Leeser, Pu Zhao, Yanzhi Wang:
EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge. CoRR abs/2402.10787 (2024) - [i24]Jun Liu, Chao Wu, Changdi Yang, Hao Tang, Haoye Dong, Zhenglun Kong, Geng Yuan, Wei Niu, Dong Huang, Yanzhi Wang:
Efficient Pruning of Large Language Model with Adaptive Estimation Fusion. CoRR abs/2403.10799 (2024) - [i23]Zhengang Li, Alec Lu, Yanyue Xie, Zhenglun Kong, Mengshu Sun, Hao Tang, Zhong Jia Xue, Peiyan Dong, Caiwen Ding, Yanzhi Wang, Xue Lin, Zhenman Fang:
Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers. CoRR abs/2407.18175 (2024) - [i22]Xuan Shen, Pu Zhao, Yifan Gong, Zhenglun Kong, Zheng Zhan, Yushu Wu, Ming Lin, Chao Wu, Xue Lin, Yanzhi Wang:
Search for Efficient Large Language Models. CoRR abs/2409.17372 (2024) - [i21]Zheng Zhan, Zhenglun Kong, Yifan Gong, Yushu Wu, Zichong Meng, Hangyu Zheng, Xuan Shen, Stratis Ioannidis, Wei Niu, Pu Zhao, Yanzhi Wang:
Exploring Token Pruning in Vision State Space Models. CoRR abs/2409.18962 (2024) - [i20]Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang:
Rethinking Token Reduction for State Space Models. CoRR abs/2410.14725 (2024) - [i19]Pu Zhao, Fei Sun, Xuan Shen, Pinrui Yu, Zhenglun Kong, Yanzhi Wang, Xue Lin:
Pruning Foundation Models for High Accuracy without Retraining. CoRR abs/2410.15567 (2024) - [i18]Zheng Zhan, Yushu Wu, Yifan Gong, Zichong Meng, Zhenglun Kong, Changdi Yang, Geng Yuan, Pu Zhao, Wei Niu, Yanzhi Wang:
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference. CoRR abs/2411.01171 (2024) - [i17]Pu Zhao, Xuan Shen, Zhenglun Kong, Yixin Shen, Sung-En Chang, Timothy Rupprecht, Lei Lu, Enfu Nan, Changdi Yang, Yumei He, Xingchen Xu, Yu Huang, Wei Wang, Yue Chen, Yong He, Yanzhi Wang:
Fully Open Source Moxin-7B Technical Report. CoRR abs/2412.06845 (2024) - 2023
- [c19]Zhenglun Kong, Haoyu Ma, Geng Yuan, Mengshu Sun, Yanyue Xie, Peiyan Dong, Xin Meng, Xuan Shen, Hao Tang, Minghai Qin, Tianlong Chen, Xiaolong Ma, Xiaohui Xie, Zhangyang Wang, Yanzhi Wang:
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training. AAAI 2023: 8360-8368 - [c18]Shengkun Tang, Yaqing Wang, Zhenglun Kong, Tianchi Zhang, Yao Li, Caiwen Ding, Yanzhi Wang, Yi Liang, Dongkuan Xu:
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model. CVPR 2023: 10781-10791 - [c17]Yifan Gong, Pu Zhao, Zheng Zhan, Yushu Wu, Chao Wu, Zhenglun Kong, Minghai Qin, Caiwen Ding, Yanzhi Wang:
Condense: A Framework for Device and Frequency Adaptive Neural Network Models on the Edge. DAC 2023: 1-6 - [c16]Changdi Yang, Yi Sheng, Peiyan Dong, Zhenglun Kong, Yanyu Li, Pinrui Yu, Lei Yang, Xue Lin:
Late Breaking Results: Fast Fair Medical Applications? Hybrid Vision Models Achieve the Fairness on the Edge. DAC 2023: 1-2 - [c15]Peiyan Dong, Mengshu Sun, Alec Lu, Yanyue Xie, Kenneth Liu, Zhenglun Kong, Xin Meng, Zhengang Li, Xue Lin, Zhenman Fang, Yanzhi Wang:
HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers. HPCA 2023: 442-455 - [c14]Changdi Yang, Yi Sheng, Peiyan Dong, Zhenglun Kong, Yanyu Li, Pinrui Yu, Lei Yang, Xue Lin, Yanzhi Wang:
Fast and Fair Medical AI on the Edge Through Neural Architecture Search for Hybrid Vision Models. ICCAD 2023: 1-9 - [c13]Peiyan Dong, Zhenglun Kong, Xin Meng, Peng Zhang, Hao Tang, Yanzhi Wang, Chih-Hsien Chou:
SpeedDETR: Speed-aware Transformers for End-to-end Object Detection. ICML 2023: 8227-8243 - [c12]Xuan Shen, Zhenglun Kong, Minghai Qin, Peiyan Dong, Geng Yuan, Xin Meng, Hao Tang, Xiaolong Ma, Yanzhi Wang:
Data Level Lottery Ticket Hypothesis for Vision Transformers. IJCAI 2023: 1378-1386 - [c11]Peiyan Dong, Zhenglun Kong, Xin Meng, Pinrui Yu, Yifan Gong, Geng Yuan, Hao Tang, Yanzhi Wang:
HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception. NeurIPS 2023 - [i16]Lu Yang, Zhenglun Kong, Ting Li, Xinyi Bai, Zhiye Lin, Hong Cheng:
GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching. CoRR abs/2308.09209 (2023) - [i15]Xuan Shen, Peiyan Dong, Lei Lu, Zhenglun Kong, Zhengang Li, Ming Lin, Chao Wu, Yanzhi Wang:
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge. CoRR abs/2312.05693 (2023) - 2022
- [c10]Geng Yuan, Sung-En Chang, Qing Jin, Alec Lu, Yanyu Li, Yushu Wu, Zhenglun Kong, Yanyue Xie, Peiyan Dong, Minghai Qin, Xiaolong Ma, Xulong Tang, Zhenman Fang, Yanzhi Wang:
You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding. ECCV (12) 2022: 34-51 - [c9]Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, Yanzhi Wang:
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning. ECCV (11) 2022: 620-640 - [c8]Geng Yuan, Yanyu Li, Sheng Li, Zhenglun Kong, Sergey Tulyakov, Xulong Tang, Yanzhi Wang, Jian Ren:
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training. NeurIPS 2022 - [i14]Geng Yuan, Yanyu Li, Sheng Li, Zhenglun Kong, Sergey Tulyakov, Xulong Tang, Yanzhi Wang, Jian Ren:
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training. CoRR abs/2209.11204 (2022) - [i13]Xuan Shen, Zhenglun Kong, Minghai Qin, Peiyan Dong, Geng Yuan, Xin Meng, Hao Tang, Xiaolong Ma, Yanzhi Wang:
The Lottery Ticket Hypothesis for Vision Transformers. CoRR abs/2211.01484 (2022) - [i12]Peiyan Dong, Mengshu Sun, Alec Lu, Yanyue Xie, Kenneth Liu, Zhenglun Kong, Xin Meng, Zhengang Li, Xue Lin, Zhenman Fang, Yanzhi Wang:
HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers. CoRR abs/2211.08110 (2022) - [i11]Zhenglun Kong, Haoyu Ma, Geng Yuan, Mengshu Sun, Yanyue Xie, Peiyan Dong, Xin Meng, Xuan Shen, Hao Tang, Minghai Qin, Tianlong Chen, Xiaolong Ma, Xiaohui Xie, Zhangyang Wang, Yanzhi Wang:
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training. CoRR abs/2211.10801 (2022) - [i10]Shengkun Tang, Yaqing Wang, Zhenglun Kong, Tianchi Zhang, Yao Li, Caiwen Ding, Yanzhi Wang, Yi Liang, Dongkuan Xu:
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model. CoRR abs/2211.11152 (2022) - 2021
- [c7]Zhengang Li, Geng Yuan, Wei Niu, Pu Zhao, Yanyu Li, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin:
NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration. CVPR 2021: 14255-14266 - [c6]Shaoyi Huang, Shiyang Chen, Hongwu Peng, Daniel Manu, Zhenglun Kong, Geng Yuan, Lei Yang, Shusen Wang, Hang Liu, Caiwen Ding:
HMC-TRAN: A Tensor-core Inspired Hierarchical Model Compression for Transformer-based DNNs on GPU. ACM Great Lakes Symposium on VLSI 2021: 169-174 - [c5]Panjie Qi, Edwin Hsing-Mean Sha, Qingfeng Zhuge, Hongwu Peng, Shaoyi Huang, Zhenglun Kong, Yuhong Song, Bingbing Li:
Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization. ICCAD 2021: 1-9 - [c4]Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang:
A Compression-Compilation Framework for On-mobile Real-time BERT Applications. IJCAI 2021: 5000-5003 - [c3]Geng Yuan, Zhiheng Liao, Xiaolong Ma, Yuxuan Cai, Zhenglun Kong, Xuan Shen, Jingyan Fu, Zhengang Li, Chengming Zhang, Hongwu Peng, Ning Liu, Ao Ren, Jinhui Wang, Yanzhi Wang:
Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI. ISQED 2021: 135-141 - [c2]Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin:
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge. NeurIPS 2021: 20838-20850 - [i9]Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang:
A Compression-Compilation Framework for On-mobile Real-time BERT Applications. CoRR abs/2106.00526 (2021) - [i8]Geng Yuan, Zhiheng Liao, Xiaolong Ma, Yuxuan Cai, Zhenglun Kong, Xuan Shen, Jingyan Fu, Zhengang Li, Chengming Zhang, Hongwu Peng, Ning Liu, Ao Ren, Jinhui Wang, Yanzhi Wang:
Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI. CoRR abs/2106.09166 (2021) - [i7]Panjie Qi, Edwin Hsing-Mean Sha, Qingfeng Zhuge, Hongwu Peng, Shaoyi Huang, Zhenglun Kong, Yuhong Song, Bingbing Li:
Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization. CoRR abs/2110.10030 (2021) - [i6]Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin:
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge. CoRR abs/2110.14032 (2021) - [i5]Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Bin Ren, Minghai Qin, Hao Tang, Yanzhi Wang:
SPViT: Enabling Faster Vision Transformers via Soft Token Pruning. CoRR abs/2112.13890 (2021) - 2020
- [c1]Bingbing Li, Zhenglun Kong, Tianyun Zhang, Ji Li, Zhengang Li, Hang Liu, Caiwen Ding:
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning. EMNLP (Findings) 2020: 3187-3199 - [i4]Zhengang Li, Yifan Gong, Xiaolong Ma, Sijia Liu, Mengshu Sun, Zheng Zhan, Zhenglun Kong, Geng Yuan, Yanzhi Wang:
SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency. CoRR abs/2001.08839 (2020) - [i3]Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang:
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization. CoRR abs/2009.06823 (2020) - [i2]Bingbing Li, Zhenglun Kong, Tianyun Zhang, Ji Li, Zhengang Li, Hang Liu, Caiwen Ding:
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning. CoRR abs/2009.08065 (2020) - [i1]Zhengang Li, Geng Yuan, Wei Niu, Yanyu Li, Pu Zhao, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin:
6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration. CoRR abs/2012.00596 (2020)
2010 – 2019
- 2017
- [j1]Fulin Zhong, Zhenglun Kong, Guoyi Xu, Ting Li:
High stability and robustness of a developed novel laser acupuncture theranostic device. Microelectron. Reliab. 78: 401-405 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-20 23:02 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint