Fix feather build failure
And add a bit more on other types of data
This commit is contained in:
parent
0bd0021537
commit
436d5786b9
|
@ -12,7 +12,6 @@ Imports:
|
||||||
broom,
|
broom,
|
||||||
dplyr,
|
dplyr,
|
||||||
DSR,
|
DSR,
|
||||||
feather,
|
|
||||||
gapminder,
|
gapminder,
|
||||||
ggplot2,
|
ggplot2,
|
||||||
hexbin,
|
hexbin,
|
||||||
|
|
29
import.Rmd
29
import.Rmd
|
@ -567,10 +567,20 @@ This makes csvs a little unreliable for caching interim results - you need to re
|
||||||
1. The feather package implements a fast binary file format that can
|
1. The feather package implements a fast binary file format that can
|
||||||
be shared across programming languages:
|
be shared across programming languages:
|
||||||
|
|
||||||
```{r}
|
```{r, eval = FALSE}
|
||||||
library(feather)
|
library(feather)
|
||||||
write_feather(challenge, "challenge.feather")
|
write_feather(challenge, "challenge.feather")
|
||||||
read_feather("challenge.feather")
|
read_feather("challenge.feather")
|
||||||
|
#> # A tibble: 2,000 x 2
|
||||||
|
#> x y
|
||||||
|
#> <dbl> <date>
|
||||||
|
#> 1 404 <NA>
|
||||||
|
#> 2 4172 <NA>
|
||||||
|
#> 3 3004 <NA>
|
||||||
|
#> 4 787 <NA>
|
||||||
|
#> 5 37 <NA>
|
||||||
|
#> 6 2332 <NA>
|
||||||
|
#> # ... with 1,994 more rows
|
||||||
```
|
```
|
||||||
|
|
||||||
feather tends to be faster than rds and is usable outside of R. `rds` supports list-columns (which you'll learn about in [[Many models]]), which feather does not yet.
|
feather tends to be faster than rds and is usable outside of R. `rds` supports list-columns (which you'll learn about in [[Many models]]), which feather does not yet.
|
||||||
|
@ -578,19 +588,24 @@ feather tends to be faster than rds and is usable outside of R. `rds` supports l
|
||||||
```{r, include = FALSE}
|
```{r, include = FALSE}
|
||||||
file.remove("challenge-2.csv")
|
file.remove("challenge-2.csv")
|
||||||
file.remove("challenge.rds")
|
file.remove("challenge.rds")
|
||||||
file.remove("challenge.feather")
|
|
||||||
```
|
```
|
||||||
|
|
||||||
## Other types of data
|
## Other types of data
|
||||||
|
|
||||||
We have worked on a number of packages to make importing data into R as easy as possible. These packages are certainly not perfect, but they are the best place to start because they behave as similar as possible to readr.
|
To get other types of data into R, we recommend starting with the packages listed below. They're certainly not perfect, but they are a good place to start as they are fully fledged members of the tidyverse.
|
||||||
|
|
||||||
Two packages helper
|
For rectanuglar data:
|
||||||
|
|
||||||
* haven reads files from other SPSS, Stata, and SAS files.
|
* haven reads SPSS, Stata, and SAS files.
|
||||||
|
|
||||||
* readxl reads excel files (both `.xls` and `.xlsx`).
|
* readxl reads excel files (both `.xls` and `.xlsx`).
|
||||||
|
|
||||||
There are two common forms of hierarchical data: XML and json. We recommend using xml2 and jsonlite respectively. These packages are performant, safe, and (relatively) easy to use. To work with these effectively in R, you'll need to x
|
* DBI, along with a database specific backend (e.g. RMySQL, RSQLite,
|
||||||
|
RPostgreSQL etc) allows you to run SQL queries against a database
|
||||||
|
and return a data frame.
|
||||||
|
|
||||||
If your data lives in a database, you'll need to use the DBI package. DBI provides a common interface that works with many different types of database. R's support is particularly good for open source databases (e.g. RPostgres, RMySQL, RSQLite, MonetDBLite).
|
For hierarchical data:
|
||||||
|
|
||||||
|
* jsonlite (by Jeroen Ooms) reads json
|
||||||
|
|
||||||
|
* xml2 reads XML.
|
||||||
|
|
Loading…
Reference in New Issue