Search Results for: String instruments
columnar format notes the format gets new datatypes : largelist(arrow- ), largebinary and largestring (arrow- ). largelist is similar to list but with -bit offsets instead of -bit. the same relationship holds for largebinary and largestring with respect to binary and string. since the last major release
. for example, an array such as ['apple', 'orange', 'apple', null, 'orange', 'orange'] has dictionary-encoded form dictionary: ['apple', 'orange'] indices: [ , , , null, , ] the parquet format uses dictionary encoding to compress data, and it is used for all parquet data types, not just binary or string...
https://arrow.apache.org/blog/