Skip to content

alwaysalearner1234/parallel-computing-r

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Parallel Computing in R using Google Colab

📌 Overview

This project demonstrates the implementation of parallel computing techniques in R using Google Colab.
The objective is to improve computational efficiency by executing independent tasks simultaneously across multiple CPU cores.

Parallel computing is applied to common statistical and data science problems, making this project suitable for academic coursework, research, and real-world analytics.


🎯 Objectives

  • Understand the fundamentals of parallel computing in R
  • Implement task and data parallelism using R packages
  • Reduce execution time for computationally intensive tasks
  • Execute parallel R programs using Google Colab

🛠 Technologies Used

  • Programming Language: R
  • Platform: Google Colab (R Runtime)
  • Parallel Libraries:
    • parallel
    • doParallel
    • foreach
    • forecast
    • boot
    • tm

📂 Project Structure

parallel-computing-r-colab/ │ ├── parallel_computing_R.ipynb ├── README.md └── requirements.txt


🚀 Implemented Modules

1. Parallel Monte Carlo Simulation

  • Estimates the value of π (Pi)
  • Uses independent random sampling
  • Demonstrates task parallelism

2. Parallel Time Series Modeling

  • Fits ARIMA models on multiple time series datasets simultaneously
  • Reduces model training time

3. Parallel Bootstrap Resampling

  • Computes bootstrap confidence intervals
  • Efficient handling of large resampling iterations

4. Parallel Text Processing

  • Performs word frequency analysis
  • Processes multiple documents concurrently

🖥 How to Run the Project (Google Colab)

  1. Open Google Colab
  2. Go to Runtime → Change runtime type
  3. Select R as the runtime
  4. Upload parallel_computing_R.ipynb
  5. Run all cells sequentially

📊 Key Concepts Demonstrated

  • Master–Worker architecture
  • CPU core utilization
  • Task-level parallelism
  • Performance optimization
  • Scalable statistical computing

✅ Advantages of Parallel Computing

  • Faster execution time
  • Efficient resource utilization
  • Scalability for large datasets
  • Improved performance for simulations and modeling

🎓 Academic Relevance

  • Suitable for Parallel Computing / R Programming / Data Science courses
  • Can be used as:
    • Mini project
    • Lab record
    • Final year project reference
    • 16-mark university exam answer

💼 Resume Description

Implemented parallel computing techniques in R using Google Colab to optimize simulations, time-series modeling, bootstrap analysis, and text processing through efficient CPU core utilization.


📌 Future Enhancements

  • Performance comparison between sequential and parallel execution
  • Integration with large real-world datasets
  • Visualization of speedup and efficiency
  • Deployment on cloud-based R environments

📜 License

This project is created for educational and academic purposes.


👩‍🎓 Author

Duddekunta Lidiya Department of Computer Science / Data Science

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors