Hi,
Could anyone please help me? I have RStudio in my local machine and running into memory issues when reading a huge data file. Planning to use SparkR. But, want to know on the below.
1.Is there any difference between Sparklyr and SparkR?
2. Since , both are used for Spark Integration from R, when to use which package?
3. Since, both have active community, trying to see which one is the best?
4. Also , are there any limitations or challenges on these two?
If anyone could provide details on this, that would be great.
Thanks!
The table at the top of the post provides a rough overview of my conclusions. Essentially sparklyr is already nicer to use even at this early stage of its development (compared to SparkR).