Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment for pbdR
  • Overview of pbdR capabilities and tools
  • Commonly used packages alongside pbdR for Big Data

Message Passing Interface (MPI)

  • Utilizing pbdR MPI 5
  • Parallel processing techniques
  • Point-to-point communication
  • Transmitting matrices
  • Summing matrices
  • Collective communication methods
  • Summing matrices using Reduce
  • Scatter and Gather operations
  • Additional MPI communication patterns

Distributed Matrices

  • Constructing a distributed diagonal matrix
  • Computing the SVD of a distributed matrix
  • Building distributed matrices in parallel

Statistical Applications

  • Monte Carlo Integration
  • Loading datasets
  • Reading data across all processes
  • Broadcasting data from a single process
  • Accessing partitioned data
  • Distributed regression analysis
  • Distributed bootstrap methods
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories