Search Results for: Eggs not for incubation
quick start user story say we have two data set(demo_src, demo_tgt), we need to know what is the data quality for target data set, based on source data set. for simplicity, suppose both two data set have the same schema as this: id bigint age int desc string dt string hour string both dt and hour are
partitions, as every day we have one daily partition dt(like ), for every day we have hourly partitions(like , , , ..., ). environment preparation you need to prepare the environment for apache griffin measure module, including the following software: jdk ( +) hadoop ( . +) spark ( . +) hive ( . ) build...
https://griffin.apache.org/docs/quickstart.html
quick start user story say we have two data set(demo_src, demo_tgt), we need to know what is the data quality for target data set, based on source data set. for simplicity, suppose both two data set have the same schema as this: id bigint age int desc string dt string hour string both dt and hour are
partitions, as every day we have one daily partition dt(like ), for every day we have hourly partitions(like , , , ..., ). environment preparation you need to prepare the environment for apache griffin measure module, including the following software: jdk ( +) hadoop ( . +) spark ( . +) hive ( . ) build...
http://griffin.apache.org/docs/quickstart.html