Reading Avro files from HDFS

If you want to read Avro files from HDFS and you’re using schema – generated classes instead of GenericRecords, you’ll have to use the specific datum reader.

                    SeekableInput input = new FsInput(path, getConfiguration());
                    DatumReader<SpecificSchemaClass> reader = new SpecificDatumReader<SpecificSchemaClass>();
                    FileReader<SpecificSchemaClass> fileReader = DataFileReader.openReader(input, reader);
                    while (fileReader.hasNext()) {
                        SpecificSchemaClass event =;

So it’s basically as easy as reading the GenericRecords.

Don’t forget to add the dependencies if you’re using maven:



