Building a linear regression model to predict a student's probability of admission to top US graduate schools — using GRE, TOEFL, CGPA, and research experience.
Jamboree Education helps students prepare for GRE and GMAT exams. They want to give students a data-driven estimate of their graduate school admission probability — based on their academic profile. A reliable predictor helps students set realistic targets and invest study time where it matters most.
| Feature | Correlation with Admission | Strength |
|---|---|---|
| CGPA | Very Strong | |
| GRE Score | Strong | |
| TOEFL Score | Strong | |
| University Rating | Moderate | |
| SOP | Moderate | |
| LOR | Moderate | |
| Research | Moderate-Weak |
from sklearn.linear_model import LinearRegression from sklearn.metrics import r2_score, mean_squared_error from sklearn.model_selection import train_test_split from sklearn.preprocessing import StandardScaler X = df.drop('Chance of Admit ', axis=1) y = df['Chance of Admit '] X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42) scaler = StandardScaler() X_train_s = scaler.fit_transform(X_train) X_test_s = scaler.transform(X_test) model = LinearRegression() model.fit(X_train_s, y_train) y_pred = model.predict(X_test_s) print("R²:", r2_score(y_test, y_pred)) # R² ≈ 0.82 — model explains 82% of admission variance