Student Clustering System using Python + Machine Learning (on CGPA)
Student Clustering System
Project Summary
The Student Clustering System is a web app made using Python and Machine Learning. It groups students based on their CGPA and other number-based data. The app is built with Streamlit, which makes it easy to use.Users can upload a CSV file with student info, and the system will use KMeans clustering to group them. The results are shown in tables and charts, and you can also download the output directly. It’s a good project for learning how ML works with real data.
Download New Real Time Projects :-Click here
 Technologies Used
- Python
- Streamlit – Web Interface
- Pandas – Data Manipulation
- scikit-learn – For KMeans Clustering & Data Scaling
- Seaborn / Matplotlib – Data Visualizations
Project Flow: Step-by-Step
1. Upload CSV File
- User uploads a
.csv
file. - The system previews the data and filters only numeric columns for clustering.
2. Configure Clustering
- A slider allows the user to select the number of clusters (K).
- Data is standardized using StandardScaler.
- KMeans algorithm clusters the data.
- A new column
Cluster
is added to the dataset showing cluster labels.
3. Show Clustered Data
- The table with clusters is displayed.
- Users can download the clustered data as a CSV file.
4. Visualizations (Charts)
The system generates four types of visualizations to interpret the clusters:
- Bar Chart – Cluster Sizes: Shows the number of students in each cluster.
- Scatter Plot: Plots the first two numeric features, color-coded by cluster.
- Pie Chart: Represents the distribution of a selected numeric feature across clusters.
5. Cluster Info (Descriptions)
Interpretations based on CGPA:
- Cluster 0: Low CGPA (below 6.5)
- Cluster 1: Average CGPA (6.5 – 8.0)
- Cluster 2: High CGPA (above 8.0)
Key Features
- Upload any student-related CSV with numeric features
- Automatically selects numeric columns
- Â KMeans clustering with customizable number of clusters
- Â Four interactive charts to visualize results
- Â Downloadable clustered data
- Â Works with all types of academic datasets (not limited to CGPA)
k-means clustering python code GitHub machine learning projects with source code GitHub real-world machine learning projects GitHub machine learning projects github python machine learning projects with source code in python machine learning projects for final year with source code machine learning projects for final year GitHub advanced machine learning projects GitHub k-means clustering in machine learning machine learning projects in python with source code
types of clustering in machine learning clustering algorithms in machine learning hierarchical clustering in machine learning k-means clustering example machine learning projects using python GitHub clustering in machine learning examples
Post Comment