Picture for Bohan Li

Bohan Li

Towards General Discrete Speech Codec for Complex Acoustic Environments: A Study of Reconstruction and Downstream Task Consistency

Add code
May 28, 2025
Viaarxiv icon

Challenger: Affordable Adversarial Driving Video Generation

Add code
May 21, 2025
Viaarxiv icon

Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Add code
May 20, 2025
Viaarxiv icon

UAV-Enabled Joint Sensing, Communication, Powering and Backhaul Transmission in Maritime Monitoring Networks

Add code
May 18, 2025
Viaarxiv icon

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Add code
Mar 19, 2025
Viaarxiv icon

MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction

Add code
Mar 13, 2025
Viaarxiv icon

Joint Beamforming and Compressed Sensing for Uplink Grant-Free Access

Add code
Mar 09, 2025
Viaarxiv icon

Recent Advances in Discrete Speech Tokens: A Review

Add code
Feb 10, 2025
Viaarxiv icon

LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding

Add code
Dec 24, 2024
Figure 1 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Figure 2 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Figure 3 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Figure 4 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Viaarxiv icon

Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits

Add code
Dec 17, 2024
Viaarxiv icon