#There are 6 mini Projects related to Data Analysis.
😊 For Project 1: Analysis of Daily Stock Returns and Exploration of Trading
Step1(Run Problem1)-Calculate Mean and Standard Deviation
Step2(Run Problem2)-Summarize Results Across Years
Step3(Run Problem3)-Compare Returns on Different Days
Step4(Run Problem4)-Analyze Patterns Across Days and Years
Step5(Run Problem5)-Compute Best and Worst Days to Invest
Step6(Run Problem6)-Evaluate Oracle Strategy and Buy-and-Hold
😊 For Project 2: Time series forecasting model
Step1(Run Problem1)-Create True Labels
Step2(Run Problem2)-Calculate Default Probability
Step3(Run Problem3)-Evaluate Consecutive Day Probabilities
Step4(Run Problem4)-Predict Labels
Step5(Run Problem5)-Ensemble Learning
😊 For Project 3:k-NN (k-nearest neighbors) and logistic regression
Step1(Run Problem1)-Data Preprocessing
Step2(Run Problem2)-Visual Analysis
Step3(Run Problem3)-Implement k-NN Classifier
Step4(Run Problem4)-Feature Selection with k-NN
Step5(Run Problem5)-Implement Logistic Regression Classifier
Step6(Run Problem6)-Feature Selection with Logistic Regression
😊 For Project 4: Analysis of Linear Models
Step1(Run Problem1)-Data Preprocessing
Step2(Run Problem2 and Problem3)-Model Comparison and Summarize Results
😊 For Project 5: Comparison of Classification Models
Step1(Run Problem1)-Data Preprocessing
Step2(Run Problem2)-Combine labels for normal and abnormal
Step3(Run Problem3)-Implement Naive Bayesian NB classifier
Step4(Run Problem4)-Implement Decision Tree classifier
Step5(Run Problem5)-Implement Random Forest classifier and find the best parameters
😊 For Project 6:Classification & Clustering Models with SVM, K-means
Step1(Run Problem1)-SVM Classification
Step2(Run Problem2)-Comparison with Other Classifiers
Step3(Run Problem3)-K-Means Clustering
Step4(Run Problem4)-Multi-Label Classifier with Largest Clusters
Step5(Run Problem5)-Evaluate Multi-Label Classifier