Class RealtimeTruncation

  • All Implemented Interfaces:

    
    public final class RealtimeTruncation
    
                        

    Controls how the realtime conversation is truncated prior to model inference. The default is auto. When set to retention_ratio, the server retains a fraction of the conversation tokens prior to the instructions.