iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.org/rec/conf/osdi/LeeLSS24.rdf

Wonbeom Lee et al.: InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management. (2024) conf/osdi/LeeLSS24 InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management. 4 Wonbeom Lee 1 Jungi Lee 2 Junghwan Seo 3 Jaewoong Sim 4 155-172 OSDI OSDI 2024 2024 provenance information for RDF data of dblp record 'conf/osdi/LeeLSS24' 2024-07-16T22:11:07+0200