Hardwood: A New Parser for Apache Parquet

(morling.dev)

35 points | by rmoff 2 days ago

2 comments

  • willtemperley 47 minutes ago
    Respect for doing this. I recently implemented a Parquet reader in Swift using parquet-java as a reference and it was by a long way the hardest bit of coding I’ve done. Your bit unpacking is interesting, is it faster then the 74 KLOC parquet-java bit unpacker?
  • uwemaurer 2 hours ago
    Great! I will give it a try. I found that using DuckDB to select from the parquet files and using the Apache Arrow API to read the results is also a very fast method