UnicodeDecodeWithOffsets (TensorFlow Core API Library 0.4.2 API)

java.lang.Object
- org.tensorflow.op.RawOp
- - org.tensorflow.op.strings.UnicodeDecodeWithOffsets<T>

Type Parameters:

T - data type for row_splits output

All Implemented Interfaces:

Op
```
public final class UnicodeDecodeWithOffsets<T extends TNumber>
extends RawOp
```
Decodes each string in input into a sequence of Unicode code points. The character codepoints for all strings are returned using a single vector char_values, with strings expanded to characters in row-major order. Similarly, the character start byte offsets are returned using a single vector char_to_byte_starts, with strings expanded in row-major order.
The row_splits tensor indicates where the codepoints and start offsets for each input string begin and end within the char_values and char_to_byte_starts tensors. In particular, the values for the ith string (in row-major order) are stored in the slice [row_splits[i]:row_splits[i+1]]. Thus:
- char_values[row_splits[i]+j] is the Unicode codepoint for the jth character in the ith string (in row-major order).
- char_to_bytes_starts[row_splits[i]+j] is the start byte offset for the jth character in the ith string (in row-major order).
- row_splits[i+1] - row_splits[i] is the number of characters in the ith string (in row-major order).

Nested Class Summary

Nested Classes
Modifier and Type	Class and Description
`static class`	`UnicodeDecodeWithOffsets.Inputs`
`static class`	`UnicodeDecodeWithOffsets.Options` Optional attributes for `UnicodeDecodeWithOffsets`

Field Summary

Fields
Modifier and Type Field and Description

static String OP_NAME
The name of this op, as known by TensorFlow core engine
- Fields inherited from class org.tensorflow.op.RawOp
  operation

Fields
Modifier and Type	Field and Description
`static String`	`OP_NAME` The name of this op, as known by TensorFlow core engine

Constructor Summary

Constructors
Constructor and Description

UnicodeDecodeWithOffsets(Operation operation)

Constructors
Constructor and Description
`UnicodeDecodeWithOffsets(Operation operation)`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`Output<TInt64>`	`charToByteStarts()` Gets charToByteStarts.
`Output<TInt32>`	`charValues()` Gets charValues.
`static <T extends TNumber> UnicodeDecodeWithOffsets<T>`	`create(Scope scope, Operand<TString> input, String inputEncoding, Class<T> Tsplits, UnicodeDecodeWithOffsets.Options... options)` Factory method to create a class wrapping a new UnicodeDecodeWithOffsets operation.
`static UnicodeDecodeWithOffsets<TInt64>`	`create(Scope scope, Operand<TString> input, String inputEncoding, UnicodeDecodeWithOffsets.Options[] options)` Factory method to create a class wrapping a new UnicodeDecodeWithOffsets operation, with the default output types.
`static UnicodeDecodeWithOffsets.Options`	`errors(String errors)` Sets the errors option.
`static UnicodeDecodeWithOffsets.Options`	`replaceControlCharacters(Boolean replaceControlCharacters)` Sets the replaceControlCharacters option.
`static UnicodeDecodeWithOffsets.Options`	`replacementChar(Long replacementChar)` Sets the replacementChar option.
`Output<T>`	`rowSplits()` Gets rowSplits.

Methods inherited from class org.tensorflow.op.RawOp
equals, hashCode, op, toString

Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait

Methods inherited from interface org.tensorflow.op.Op
env

- Field Detail
  - OP_NAME
```
public static final String OP_NAME
```
    The name of this op, as known by TensorFlow core engine
    
    See Also:
    
    Constant Field Values
- Constructor Detail
  - UnicodeDecodeWithOffsets
```
public UnicodeDecodeWithOffsets(Operation operation)
```
- Method Detail
  - create
```
@Endpoint(describeByClass=true)
public static <T extends TNumber> UnicodeDecodeWithOffsets<T> create(Scope scope,
                                                                                                     Operand<TString> input,
                                                                                                     String inputEncoding,
                                                                                                     Class<T> Tsplits,
                                                                                                     UnicodeDecodeWithOffsets.Options... options)
```
    Factory method to create a class wrapping a new UnicodeDecodeWithOffsets operation.
    
    Type Parameters:
    
    T - data type for UnicodeDecodeWithOffsets output and operands
    
    Parameters:
    
    scope - current scope
    
    input - The text to be decoded. Can have any shape. Note that the output is flattened to a vector of char values.
    
    inputEncoding - Text encoding of the input strings. This is any of the encodings supported by ICU ucnv algorithmic converters. Examples: "UTF-16", "US ASCII", "UTF-8".
    
    Tsplits - The value of the Tsplits attribute
    
    options - carries optional attribute values
    
    Returns:
    
    a new instance of UnicodeDecodeWithOffsets
  - create
```
@Endpoint(describeByClass=true)
public static UnicodeDecodeWithOffsets<TInt64> create(Scope scope,
                                                                                      Operand<TString> input,
                                                                                      String inputEncoding,
                                                                                      UnicodeDecodeWithOffsets.Options[] options)
```
    Factory method to create a class wrapping a new UnicodeDecodeWithOffsets operation, with the default output types.
    
    Parameters:
    
    scope - current scope
    
    input - The text to be decoded. Can have any shape. Note that the output is flattened to a vector of char values.
    
    inputEncoding - Text encoding of the input strings. This is any of the encodings supported by ICU ucnv algorithmic converters. Examples: "UTF-16", "US ASCII", "UTF-8".
    
    options - carries optional attribute values
    
    Returns:
    
    a new instance of UnicodeDecodeWithOffsets, with default output types
  - errors
```
public static UnicodeDecodeWithOffsets.Options errors(String errors)
```
    Sets the errors option.
    
    Parameters:
    
    errors - Error handling policy when there is invalid formatting found in the input. The value of 'strict' will cause the operation to produce a InvalidArgument error on any invalid input formatting. A value of 'replace' (the default) will cause the operation to replace any invalid formatting in the input with the replacement_char codepoint. A value of 'ignore' will cause the operation to skip any invalid formatting in the input and produce no corresponding output character.
    
    Returns:
    
    this Options instance.
  - replacementChar
```
public static UnicodeDecodeWithOffsets.Options replacementChar(Long replacementChar)
```
    Sets the replacementChar option.
    
    Parameters:
    
    replacementChar - The replacement character codepoint to be used in place of any invalid formatting in the input when errors='replace'. Any valid unicode codepoint may be used. The default value is the default unicode replacement character is 0xFFFD or U+65533.)
    
    Returns:
    
    this Options instance.
  - replaceControlCharacters
```
public static UnicodeDecodeWithOffsets.Options replaceControlCharacters(Boolean replaceControlCharacters)
```
    Sets the replaceControlCharacters option.
    
    Parameters:
    
    replaceControlCharacters - Whether to replace the C0 control characters (00-1F) with the replacement_char. Default is false.
    
    Returns:
    
    this Options instance.
  - rowSplits
```
public Output<T> rowSplits()
```
    Gets rowSplits. A 1D int32 tensor containing the row splits.
    
    Returns:
    
    rowSplits.
  - charValues
```
public Output<TInt32> charValues()
```
    Gets charValues. A 1D int32 Tensor containing the decoded codepoints.
    
    Returns:
    
    charValues.
  - charToByteStarts
```
public Output<TInt64> charToByteStarts()
```
    Gets charToByteStarts. A 1D int32 Tensor containing the byte index in the input string where each character in char_values starts.
    
    Returns:
    
    charToByteStarts.

Class UnicodeDecodeWithOffsets<T extends TNumber>

Nested Class Summary

Field Summary

Fields inherited from class org.tensorflow.op.RawOp

Constructor Summary

Method Summary

Methods inherited from class org.tensorflow.op.RawOp

Methods inherited from class java.lang.Object

Methods inherited from interface org.tensorflow.op.Op

Field Detail

OP_NAME

Constructor Detail

UnicodeDecodeWithOffsets

Method Detail

create

create

errors

replacementChar

replaceControlCharacters

rowSplits

charValues

charToByteStarts