Adds chapter about arrow (#1137)

Co-authored-by: Neal Richardson <neal.p.richardson@gmail.com>
Co-authored-by: Hadley Wickham <hadley@posit.co>
Co-authored-by: Mine Cetinkaya-Rundel <cetinkaya.mine@gmail.com>
This commit is contained in:
Danielle Navarro
2022-12-08 12:43:11 +11:00
committed by GitHub
parent 3c4ee847e0
commit 281005a31c
8 changed files with 328 additions and 11 deletions

View File

@@ -413,7 +413,7 @@ read_csv("students-2.csv")
```
This makes CSVs a little unreliable for caching interim results---you need to recreate the column specification every time you load in.
There are two main options:
There are two main alternative:
1. `write_rds()` and `read_rds()` are uniform wrappers around the base functions `readRDS()` and `saveRDS()`.
These store data in R's custom binary format called RDS:
@@ -423,7 +423,8 @@ There are two main options:
read_rds("students.rds")
```
2. The arrow package allows you to read and write parquet files, a fast binary file format that can be shared across programming languages:
2. The arrow package allows you to read and write parquet files, a fast binary file format that can be shared across programming languages.
We'll come back to arrow in more depth in @sec-arrow.
```{r}
#| eval: false
@@ -442,7 +443,7 @@ There are two main options:
#> 6 6 Güvenç Attila Ice cream Lunch only 6
```
Parquet tends to be much faster than RDS and is usable outside of R, but does require you install the arrow package.
Parquet tends to be much faster than RDS and is usable outside of R, but does require the arrow package.
```{r}
#| include: false