Package

org.opencypher.spark.impl.io

hdfs

Permalink

package hdfs

Visibility
  1. Public
  2. All

Type Members

  1. case class CsvField(name: String, column: Int, valueType: String) extends Product with Serializable

    Permalink
  2. trait CsvFileHandler extends AnyRef

    Permalink
  3. class CsvGraphLoader extends AnyRef

    Permalink

    Loads a graph stored in indexed CSV format from HDFS or the local file system The CSV files must be stored following this schema: # Nodes

    Loads a graph stored in indexed CSV format from HDFS or the local file system The CSV files must be stored following this schema: # Nodes

    • all files describing nodes are stored in a subfolder called "nodes"
    • create one file for each possible label combination that exists in the data, i.e. there must not be overlapping entities in different files (e.g. all nodes with labels :Person:Employee in a single file and all nodes that have label :Person exclusively in another file)
    • for every node csv file create a schema file called FILE_NAME.csv.SCHEMA
    • for information about the structure of the node schema file see CsvNodeSchema # Relationships
    • all files describing relationships are stored in a subfolder called "relationships"
    • create one csv file per relationship type
    • for every relationship csv file create a schema file called FILE_NAME.csv.SCHEMA
    • for information about the structure of the relationship schema file see CsvRelSchema
  4. class CsvGraphWriter extends AnyRef

    Permalink
  5. case class CsvNodeSchema(idField: CsvField, implicitLabels: List[String], optionalLabels: List[CsvField], propertyFields: List[CsvField]) extends CsvSchema with Product with Serializable

    Permalink
  6. case class CsvRelSchema(idField: CsvField, startIdField: CsvField, endIdField: CsvField, relationshipType: String, propertyFields: List[CsvField]) extends CsvSchema with Product with Serializable

    Permalink
  7. abstract class CsvSchema extends AnyRef

    Permalink
  8. final class HadoopFileHandler extends CsvFileHandler

    Permalink
  9. final class LocalFileHandler extends CsvFileHandler

    Permalink

Value Members

  1. object CsvField extends Serializable

    Permalink
  2. object CsvGraphLoader

    Permalink
  3. object CsvGraphWriter

    Permalink
  4. object CsvNodeSchema extends Serializable

    Permalink

    Reads the schema of a node csv file.

    Reads the schema of a node csv file. The schema file is in JSON format and has the following structure: { "idField": { "name": "id", "column": 0, "valueType": "LONG" }, "implicitLabels": ["Person","Employee"], "optionalLabels": [ { "name": "Swede", "column": 3, "valueType": "BOOLEAN" }, { "name": "German", "column": 4, "valueType": "BOOLEAN" } ], "propertyFields": [ { "name": "name", "column": 1, "valueType": "STRING" }, { "name": "luckyNumber", "column": 2, "valueType": "INTEGER" } ] }

  5. object CsvRelSchema extends Serializable

    Permalink

    Reads the schema of a relationship csv file.

    Reads the schema of a relationship csv file. The schema file is in JSON format and has the following structure: { "idField": { "name": "id", "column": 0, "valueType": "LONG" }, "startIdField": { "name": "start", "column": 1, "valueType": "LONG" }, "endIdField": { "name": "end", "column": 2, "valueType": "LONG" }, "relationshipType": "KNOWS", "propertyFields": [ { "name": "since", "column": 3, "valueType": "INTEGER" } ] }

  6. object CsvSchemaUtils

    Permalink

Ungrouped