Hi, I'm trying to use ExtMemQuantileDMatrix for training huge dateset on gpus. For example, training 1Tb raw fp32 dataset on 4/8xRTX 4090 (24G) + 2/4Tb memory (which is sufficent for the same dataset ...
TL;DR: After two back-to-back langgraph invocations on a warm server, heap never returns to baseline. Biggest deltas: strings +~2.3 MB (duplicate prompts/config blobs) and compiled code +~3.5 MB ...
Abstract: LLVM frontends like Clang preserve source-level type information in the intermediate representation (IR) primarily through debug metadata. Although intended for debuggers, this metadata ...