Class ParquetAvroDataFormat

All Implemented Interfaces:
CopyableDefinition<DataFormatDefinition>

@Metadata(firstVersion="4.0.0", label="dataformat,transformation,file", title="Parquet File") public class ParquetAvroDataFormat extends DataFormatDefinition
Parquet Avro serialization and de-serialization.
  • Constructor Details

    • ParquetAvroDataFormat

      public ParquetAvroDataFormat()
    • ParquetAvroDataFormat

      protected ParquetAvroDataFormat(ParquetAvroDataFormat source)
    • ParquetAvroDataFormat

      public ParquetAvroDataFormat(String unmarshalTypeName)
    • ParquetAvroDataFormat

      public ParquetAvroDataFormat(Class<?> unmarshalType)
    • ParquetAvroDataFormat

      public ParquetAvroDataFormat(boolean lazyLoad)
  • Method Details

    • copyDefinition

      public ParquetAvroDataFormat copyDefinition()
      Description copied from interface: CopyableDefinition
      Makes a copy of this definition.
      Specified by:
      copyDefinition in interface CopyableDefinition<DataFormatDefinition>
      Overrides:
      copyDefinition in class DataFormatDefinition
    • setCompressionCodecName

      public void setCompressionCodecName(String compressionCodecName)
      Compression codec to use when marshalling.
    • getCompressionCodecName

      public String getCompressionCodecName()
    • getUnmarshalType

      public Class<?> getUnmarshalType()
    • setUnmarshalType

      public void setUnmarshalType(Class<?> unmarshalType)
      Class to use when unmarshalling.
    • setUnmarshalTypeName

      public void setUnmarshalTypeName(String unmarshalTypeName)
      Class to use when (un)marshalling. If omitted, parquet files are converted into Avro's GenericRecords for unmarshalling and input objects are assumed as GenericRecords for marshalling.
    • getUnmarshalTypeName

      public String getUnmarshalTypeName()
    • getLazyLoad

      public String getLazyLoad()
    • setLazyLoad

      public void setLazyLoad(String lazyLoad)
      Whether the unmarshalling should produce an iterator of records or read all the records at once.