With the exponential growth of video traffic, traditional video streaming
systems are approaching their limits in compression efficiency and
communication capacity. To further reduce bitrate while maintaining quality, we
propose Promptus, a disruptive novel system that streaming prompts instead of
video content with Stable Diffusion, which converts video frames into a series
of "prompts" for delivery. To ensure pixel alignment, a gradient descent-based
prompt fitting framework is proposed. To achieve adaptive bitrate for prompts,
a low-rank decomposition-based bitrate control algorithm is introduced. For
inter-frame compression of prompts, a temporal smoothing-based prompt
interpolation algorithm is proposed. Evaluations across various video domains
and real network traces demonstrate Promptus can enhance the perceptual quality
by 0.111 and 0.092 (in LPIPS) compared to VAE and H.265, respectively, and
decreases the ratio of severely distorted frames by 89.3% and 91.7%. Moreover,
Promptus achieves real-time video generation from prompts at over 150 FPS. To
the best of our knowledge, Promptus is the first attempt to replace video
codecs with prompt inversion and the first to use prompt streaming instead of
video streaming. Our work opens up a new paradigm for efficient video
communication beyond the Shannon limit