Update arrow.qmd (#1568)

This commit is contained in:
Kevin Tappe
2023-08-24 15:26:20 +02:00
committed by GitHub
parent 97577936db
commit b95d846283

View File

@@ -69,8 +69,8 @@ curl::multi_download(
## Opening a dataset ## Opening a dataset
Let's start by taking a look at the data. Let's start by taking a look at the data.
At 9GB, this file is large enough that we probably don't want to load the whole thing into memory. At 9 GB, this file is large enough that we probably don't want to load the whole thing into memory.
A good rule of thumb is that you usually want at least twice as much memory as the size of the data, and many laptops top out at 16 Gb. A good rule of thumb is that you usually want at least twice as much memory as the size of the data, and many laptops top out at 16 GB.
This means we want to avoid `read_csv()` and instead use the `arrow::open_dataset()`: This means we want to avoid `read_csv()` and instead use the `arrow::open_dataset()`:
```{r open-dataset} ```{r open-dataset}