Example of sharing tibble between R and Python/Pandas?

RussellPierce · September 5, 2019, 5:38pm

I'd like to see a proof of concept of moving data from R to Pandas and back using Apache Arrow. Something similar was asked not too long ago: Apache Arrow: shared in-memory data object. But I thought I'd jump in and see if there are any updates/worked examples/clearer explanations.

tom_rstudio · September 6, 2019, 4:34pm

Hey Russell!

I passed along this link to some of our Solutions Engineers to take a look at, seeing if we can get some examples!

EconomiCurtis · September 6, 2019, 5:00pm

Neal on the Arrow team pointed out an issue they are working on.

https://issues.apache.org/jira/browse/ARROW-3750

That would be a good item to watch, comment, and/or contribute to.

RussellPierce · September 6, 2019, 11:03pm

Thanks! An example of zero copy with reticulate would be awesome too. I'll follow that issue with interest!

But for now I'd settle for independently in entirely separate R and Python processes manually passing the memory address from one to the other. But, in particular a whole table/tibble (Single arrays and atomic items are most of the examples I've seen so far). But it looks like a fair bit of boilerplate needs to be written too.

system · September 27, 2019, 11:03pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.