bing.com
Google open-sources speedy DiffusionGemma text diffusion model
Google says that DiffusionGemma can generate more than 1,000 tokens per second when running on a single H100, a server-grade GPU that Nvidia Corp. launched in 2022. The model can generate over 700 ...
2 days ago