Combining Large and Small LLMs to Boost Inference Time and Quality | Towards Data Science

Implementing Speculative and Contrastive Decoding

By · · 1 min read
Combining Large and Small LLMs to Boost Inference Time and Quality | Towards Data Science

Source: Towards Data Science

Implementing Speculative and Contrastive Decoding