Link_Prediction_Project

This is course project for MSBD 5008.

data_preprocessing.py is used to generate training and testing set for xgboost.
It will use thost traditional method to calculate score for a nodes pair as features when training. Currently the methods used are provided by networkit, may change to our own methods later.
classify.py is used to predict all with those methods and save the results in the result folder.

Things to do

compare results for different methods
friend recommend with xgboost model

About LFS

Because github limites the size of one single file to 50M, and our dataset generated has exceeded the limitation,
LFS is used for storing those large files, which costs me $USD 5 per month (a meal).
Therefore I will only keep those large files for one month, and then remove them after this semester.
If you want to keep those data, remember to make a backup in this month.

About LFS
Delete previous commits

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
report		report
README.md		README.md
classify.py		classify.py
conclusion.ipynb		conclusion.ipynb
data_preprocessing.py		data_preprocessing.py
feature_importance_xgb.png		feature_importance_xgb.png
linkPrediction.py		linkPrediction.py
roc_curves.png		roc_curves.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Link_Prediction_Project

This is course project for MSBD 5008.

Things to do

About LFS

About

Releases

Packages

Languages

kahungchong/Link_Prediction

Folders and files

Latest commit

History

Repository files navigation

Link_Prediction_Project

This is course project for MSBD 5008.

Things to do

About LFS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages