David McGinnisMar 17, 20206 minTesting a Hive Patch on a Local System[...] I needed to get a Hive cluster running my code and a Confluent cluster that could output Avro messages in the proper format to test.
David McGinnisMar 10, 20204 minA Crash Course in Proper Oozie Usage[...] focus on best practices such as when and why you should use Oozie, and when to use bundles.
David McGinnisFeb 25, 20207 minDebugging From The Field: The Case of the Empty FilesA team at a client was using Spark to read and write to a Kafka topic. [...] files that would be written that were completely empty.
David McGinnisDec 3, 20195 minDebugging From The Field: The Case of the Ignored Configuration ChangeWe made the change on a Sunday, but four days later, the number of files had not appreciably changed in the YARN logs directory.