← 返回首页
GitHub - BurnWan/RTPrune: [ICML26]RTPrune is a two-stage, training-free visual token pruning framework for DeepSeek-OCR that mimics the LLM’s reading-twice behavior and adopts a dynamic pruning ratio, achieving efficient inference while preserving high OCR accuracy. · GitHub
Skip to content

Navigation Menu

Toggle navigation
Sign in
Appearance settings
Search or jump to...

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
Resetting focus

BurnWan/RTPrune

Go to file
Code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
View all files

Repository files navigation

RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference

🔥News

  • [2026.05.01] 🎉 Our training-free inference acceleration method RTPrune has been accepted at ICML 2026.

✨Highlights

  1. Our RTPrune consistently outperforms prior token pruning methods on DeepSeek-OCR, retaining over 97.88% of accuracy with 84% of visual tokens on olmOCR-Bench.
  2. Our RTPrune reduces GFLOPs by nearly 15.29% and prefill time by nearly 18.90% on OmniDocBench when maintaining 99.47% accuracy.

🌈Method

  1. We introduce RTPrune, a plug-and-play visual token pruning method in DeepSeek-OCR which mimics the reading twice behavior of the LLM via a two-stage pipeline: retaining high-norm tokens and then merging the remaining ones via optimal transport.
  2. We propose a dynamic pruning strategy to enable a better efficiency–accuracy trade-off, which combines the post-encoding inter-token similarity and the original textual density of the image.

📦Installation

  1. Install the DeepSeek-OCR environment.

  2. Download the ckpt files from huggingface and put them in ./DeepSeek-OCR/DeepSeek-OCR-master/DeepSeek-OCR-ckpt.

  3. Replace the corresponding files or add new files with our code and the added part can be searched by "[modified]".

🚀Quick Start

Run the following command:

cd DeepSeek-OCR/DeepSeek-OCR-master/DeepSeek-OCR-hf python run_dpsk_ocr.py

📊Evaluation

👏Acknowledgement

  • This work is built upon DeepSeek-OCR. We thank them for their excellent open-source contributions.

  • We also thank VisionZip, DivPrune, CDPruner, and others for their contributions, which have provided valuable insights.

About

[ICML26]RTPrune is a two-stage, training-free visual token pruning framework for DeepSeek-OCR that mimics the LLM’s reading-twice behavior and adopts a dynamic pruning ratio, achieving efficient inference while preserving high OCR accuracy.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

Footer

© 2026 GitHub, Inc.