Big Data with R Workshop

edgararuiz · January 11, 2020, 2:20am

1/27/20—1/28/20
9:00 AM-5:00 PM
2 Day Workshop

Edgar Ruiz
Solutions Engineer
RStudio

James Blair
Solutions Engineer
RStudio

This 2-day workshop covers how to analyze large amounts of data in R. We will focus on scaling up our analyses using the same dplyr verbs that we use in our everyday work. We will use dplyr with data.table, databases, and Spark. We will also cover best practices on visualizing, modeling, and sharing against these data sources. Where applicable, we will review recommended connection settings, security best practices, and deployment options.”

Who should attend:

You should take this workshop if you want to learn how to work with big data in R. This data can be in-memory, in databases (like SQL Server), or in a cluster (like Spark).

skamanrev · January 23, 2020, 12:28am

Hi Is there anything we should do to prepare for this, other than have the suggested RStudio,R versions installed. Are there package prerequisites, dplyr and what else?

Thanks in advance

edgararuiz · January 23, 2020, 3:01pm

Hi @skamanrev, thank you for your question. We plan to provide each student a server in the cloud (AWS) that you will be able to access via a web browser in your machine. Please see this section in your GitHub repository for more info: https://github.com/rstudio-conf-2020/big-data#equipment

skamanrev · January 23, 2020, 4:14pm

Got it. Thanks Edgar.

Blair09M · January 31, 2020, 1:05am

The slides have been added to the Big Data course repository: https://github.com/rstudio-conf-2020/big-data/tree/master/slides

Thank you to everyone who came and participated in the workshop!

landstrom91 · February 4, 2020, 12:44pm

Hi!

I'm trying to do the exercises provided on the Big Data repository, but I can't seem to find the data sets.
Could you please upload the data or add a link to the data?

Thanks

system · April 9, 2024, 5:21pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.