-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.Rmd
52 lines (34 loc) · 1.04 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---
output: rmarkdown::github_document
---
# sparrow
Temporary Shorcut For Reading Arrow/Parquet Bit Into R via 'reticulate'
## Description
Work is being done to make Parquet/Arrow a first-class R citizen
but -- until then -- I don't always want a Drill server round trip just
to read in some data and same goes for firing up a Spark instance (srsly).
So, this is a quick hack until the R packages are done.
## NOTE
**Requires** Python 3.5+, `pyarrow` and `pandas`.
## What's Inside The Tin
The following functions are implemented:
- `read_parquet`: Read in data from Parquet into an R data frame via 'reticulate'
## Installation
```{r eval=FALSE}
devtools::install_github("hrbrmstr/sparrow")
```
```{r message=FALSE, warning=FALSE, error=FALSE, include=FALSE}
options(width=120)
```
## Usage
```{r message=FALSE, warning=FALSE, error=FALSE}
library(sparrow)
# current verison
packageVersion("sparrow")
```
```{r cache=TRUE}
read_parquet("/tmp/honeypot.parquet")
```
```{r cache=TRUE}
read_parquet("/tmp/honeypot.parquet", c("src", "duration"))
```