Skip to content

This course aims to introduce students to data analytics techniques using Python, with a focus on Exploratory Data Analysis (EDA), regression, and supervised learning. It equips learners with practical skills in handling data, automating EDA, and applying machine learning concepts in real-world scenarios.

License

Notifications You must be signed in to change notification settings

sbccas/data-analytics-using-python

Repository files navigation

πŸ“˜ Data Analytics Using Python

TYBCA – Course Code 602

A complete interactive learning repository for students of Data Analytics & Python

This course aims to introduce students to data analytics techniques using Python, with a focus on Exploratory Data Analysis (EDA), regression, and supervised learning. It equips learners with practical skills in handling data, automating EDA, and applying machine learning concepts in real-world scenarios.


🌟 Overview

Welcome to the official repository for the TYBCA – Data Analytics Using Python course under VNSGU. This repository is designed to provide students with:

  • Interactive Google Colab notebooks
  • High-quality teaching materials
  • Practical assignments & lab exercises
  • Real-world datasets
  • Step-by-step EDA & Machine Learning basics
  • Student-friendly explanations + hands-on examples

This course emphasizes learning-by-doing, enabling students to explore data, visualize patterns, clean datasets, and understand foundational ML concepts.


πŸš€ Features of This Repository

βœ” Well-structured unit-wise content βœ” Colab-ready notebooks with β€œOpen in Colab” support βœ” Beginner-friendly explanations & visualizations βœ” Assignments + practice tasks for each unit βœ” Real datasets for hands-on learning βœ” Mini-project templates for student submissions βœ” Vedic Mathematics Sutra implementations (Unit 4) βœ” Continuously updated with new notebooks and improvements


πŸ“‚ Repository Structure

data-analytics-using-python/
β”‚
β”œβ”€β”€ 1_Syllabus/
β”‚   β”œβ”€β”€ 602_Data_Analytics_using_Python.pdf         # official syllabus (uploaded)
β”‚   
β”‚   
β”‚
β”œβ”€β”€ 2_Lecture_Notes/
β”‚   β”œβ”€β”€ Unit1_Fundamentals
β”‚   β”œβ”€β”€ Unit2_Automated_EDA/
β”‚   β”œβ”€β”€ Unit3_Supervised_Learning/
β”‚   └── Unit4_Vedic_Math_Sutras/
β”‚
β”œβ”€β”€ 3_Projects_Presentations/
β”‚   β”œβ”€β”€ Mini_Project_Template.ipynb
β”‚   β”œβ”€β”€ Student_Submissions/          # (one folder per student/group or zipped uploads)
β”‚   └── Project_Evaluation_Rubric.md
β”‚
β”œβ”€β”€ 4_Assignments/
β”‚   β”œβ”€β”€ Unit1_Assignment/
β”‚   β”œβ”€β”€ Unit2_Assignment/
β”‚   └── Unit3_Assignment/
β”‚
β”œβ”€β”€ 5_QuestionBank/
β”‚   β”œβ”€β”€ Unit1_MCQ.md
β”‚   β”œβ”€β”€ Unit1_Short_Long_Questions.md
β”‚   └── Practical_Exam_Questions.md
β”‚
β”œβ”€β”€ 6_eBooks_ExtraResources/
β”‚   β”œβ”€β”€ Reema_Thareja_Python_for_Data_Analysis.pdf   # if allowed by license / links
β”‚   β”œβ”€β”€ References.md                                # canonical reading list + links
β”‚   └── Tutorials/                                   # curated external links
β”‚
β”œβ”€β”€ 7_Previous_Year_Papers/
β”‚
β”œβ”€β”€ resources/
β”‚   β”œβ”€β”€ datasets/
β”‚   β”‚   β”œβ”€β”€ students_performance.csv
β”‚   β”‚   β”œβ”€β”€ iris.csv
β”‚   β”‚   └── house_prices.csv
β”‚   β”œβ”€β”€ notebooks/
β”‚   β”‚   β”œβ”€β”€ notebooks_list.md        # index of notebooks + "Open in Colab" links
β”‚   β”‚   β”œβ”€β”€ Unit1_Fundamentals.ipynb
β”‚   β”‚   β”œβ”€β”€ Unit1_Student_Workbook.ipynb
β”‚   β”‚   └── Unit2_Automated_EDA.ipynb
β”‚   β”œβ”€β”€ assets/
β”‚   β”‚   β”œβ”€β”€ github_banner.png
β”‚   β”‚   └── logos/
β”‚   └── data_dictionary.md
β”‚
β”œβ”€β”€ README.md
β”œβ”€β”€ LICENSE
└── .gitignore

πŸ“˜ Course Units

πŸ“ Unit 1 – Fundamentals of Data Analytics

  • EDA introduction
  • Types of analysis (Univariate, Bivariate, Multivariate)
  • Missing values, outliers
  • Normal & skewed distributions
  • Skewness & kurtosis

πŸ‘‰ Notebook: /notebooks/Unit1_Fundamentals.ipynb


πŸ“ Unit 2 – Automated EDA & Regression

  • Pandas & NumPy techniques
  • Automated EDA tools
  • Regression basics
  • Covariance & correlation
  • Machine Learning introduction

πŸ‘‰ Notebook: /notebooks/Unit2_Automated_EDA.ipynb


πŸ“ Unit 3 – Supervised Learning

  • Classification vs Regression
  • Dataset splitting
  • Overfitting & Underfitting
  • Evaluation metrics: MSE, MAE, RΒ²

πŸ‘‰ Notebook: /notebooks/Unit3_Supervised_Learning.ipynb


πŸ“ Unit 4 – Vedic Mathematics Sutras

  • Logical reasoning with Vedic Math
  • 16 Sutras implemented in Python/C
  • Fast numeric techniques
  • Algorithmic thought development

πŸ‘‰ Notebook: /notebooks/Unit4_Vedic_Math_Sutras.ipynb


πŸ”— Open Notebooks in Google Colab

Every notebook in this repository is Colab-ready.

Use this badge template:

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](
https://colab.research.google.com/github/sbccas/data-analytics-using-python/blob/main/notebooks/<NOTEBOOK_NAME>.ipynb)

🧠 Assignments, Labs & Projects

This repository includes:

  • πŸ“ Unit-wise Assignments
  • πŸ§ͺ Lab exercises
  • πŸ“Š Practice datasets
  • πŸš€ Mini-project templates
  • 🎯 Final capstone project outline

Students can open all tasks directly in Google Colab.


πŸ“Š Recommended Datasets (Included or Suggested)

  • StudentsPerformance dataset
  • Iris dataset
  • House Prices dataset
  • Small Retail Sales dataset
  • Attendance / Marks dataset

Datasets are located in /datasets/.


🀝 Contributing

Students and educators are welcome to contribute by:

  • Adding new datasets
  • Improving notebook content
  • Creating examples & visualizations
  • Submitting beginner-level ML notebooks
  • Reporting issues or suggesting improvements

Pull requests are encouraged!


πŸ‘¨β€πŸ« Maintained By

Hitech Educator & IT Professional Expert in Python, Data Analytics, C Programming, .NET, and teaching under VNSGU for over two decades. Passionate about helping students learn through interactive examples and hands-on exploration.


⭐ Support & Feedback

If you find this repository useful:

  • ⭐ Star this repo
  • πŸ—£ Share with classmates
  • πŸ“ Open issues for feedback
  • 🀝 Contribute with notebooks/datasets

πŸ“’ License

This repository is intended for educational and academic use. All materials are freely available for students and faculty for learning purposes.


About

This course aims to introduce students to data analytics techniques using Python, with a focus on Exploratory Data Analysis (EDA), regression, and supervised learning. It equips learners with practical skills in handling data, automating EDA, and applying machine learning concepts in real-world scenarios.

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •