This commit is contained in:
Ralph Straumann
2023-05-25 17:31:24 +02:00
committed by GitHub
parent 8d4fed1da0
commit 476f5c01b6

View File

@@ -140,7 +140,7 @@ This means that:
This means that they're organized column-by-column, much like R's data frame.
This typically leads to better performance for data analysis tasks compared to CSV files, which are organized row-by-row.
- Parquet files are "chunked", which makes it possible to work on different parts of the file at the same time, and, if you're lucky, to skip some chunks all together.
- Parquet files are "chunked", which makes it possible to work on different parts of the file at the same time, and, if you're lucky, to skip some chunks altogether.
There's one primary disadvantage to parquet files: they are no longer "human readable", i.e. if you look at a parquet file using `readr::read_file()`, you'll just see a bunch of gibberish.