Wonbeom Lee et al.: InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management. (2024)conf/osdi/LeeLSS24InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management.4Wonbeom Lee1Jungi Lee2Junghwan Seo3Jaewoong Sim4155-172OSDIOSDI20242024provenance information for RDF data of dblp record 'conf/osdi/LeeLSS24'2024-07-16T22:11:07+0200