Class RealtimeTruncationRetentionRatio

  • All Implemented Interfaces:

    
    public final class RealtimeTruncationRetentionRatio
    
                        

    Retain a fraction of the conversation tokens when the conversation exceeds the input token limit. This allows you to amortize truncations across multiple turns, which can help improve cached token usage.