...
Methodologies Step by Step. If you just have to see one single thing. In the image, from the Greyed Boxes at the bottom, go back to top. That way the image will look pretty simple.
Presentation Slides: https://docs.google.com/presentation/d/13RELxQhucZH716WIfPDSKhuBJfJ07Cel5Cw5IS1aeUk/edit?usp=sharing
In-Progress: Questions to Answer for Data Science Project Development: http://bangla.salearningschool.com/recent-posts/can-you-answer-these-random-questions-on-data-science-project-development/
https://github.com/sayedum/acr-prediction-from-dietary-patterns-for-ckd-patients
You can just check the methodology diagram. The diagram though might look complex at first glance, there are actually only three different paths. I can add one more diagram to show that. https://github.com/sayedum/acr-prediction-from-dietary-patterns-for-ckd-patients/blob/master/final_MRP_poster_sayed_ahmed_2019.pdf
You can ignore this code for this session.https://github.com/sayedum/acr-prediction-from-dietary-patterns-for-ckd-patients/blob/master/Python-or-R-Code/food-subgroup-regression-prediction.ipynb
The most important part in the code to look is: When you will be described on 5 or 10 fold cross validations - that might seem complicated conceptually i.e. if you want to write all the code to divide data and keep track what to merge, what not to merge, what is the train dataset and what is the test dataset. In reality, in Python code this is very basic; you just have to use the right library function and right parameters.
Statistics for Big Data for Dummies: This book will help you a lot to develop your research methodologies for Data Science projects. Kind of must read.
Chapter 19: Design and Analysis of Machine Learning Experiments: Gives details on Research Methodologies for Data Science Projects. Not everyone will find the book to be an easy read. You do not have to read though this.https://www.academia.edu/28701585/Introduction_to_Machine_Learning_2e_Ethem_Alpaydin
The URL that will open: https://docs.google.com/spreadsheets/d/1m6yoKA-_BPxntKOKOHevnGPcpxoPecvP-4d3p3MFUiE/edit?usp=sharing
NHANES: Survey Data Files: CSV Format: https://drive.google.com/drive/folders/1fi_eQ9zWbi-RSegBuLSrpYLfaDML67pW?usp=sharing
Using Excel Data Analysis Plugin: https://docs.google.com/document/d/1vfkqgUZlc_TfbTg23TEKhIEriBDnoT7b7PRj5mL0fyU/edit?usp=sharing
For my project this was not important. Try to know what Factor Analysis is and How does it differ from PCA (Principal Component Analysis). https://drive.google.com/file/d/1fLJMCKVXCJUS1DXY-eDvg7ntJ1Wl26LZ/view?usp=sharing
The URL: https://drive.google.com/file/d/1-rwMcCsy2oKWWULEX9BFDf_gLhoyD-9l/view?usp=sharing
Execute the same work and steps as my current project did. However, combine data from 1996-2016 first and then apply on this big dataset. My current work was applied only on the data from 2015 - 2016.
Will share Github and Researchgate projects.
Association of Blood Pressure measures with Dietary Pattern. The NHANES survey also provided Blood Pressure measures. Hence, using the similar approach you will be able to associate the measures with dietary intake. Also, if enough correlation whether you can predict or not.
Using the similar approach as my project utilized, you might be able to associate how the assets inside an Investment Portfolio affecting the investment return. This eventually can help to create a great portfolio (sure that will need to integrate other components of Investing)
Quantitative, Graphical, Univariate, BiVariate, Regression
Sample Python code and plots
The content on medium might not be free. Hence, read on bangla.salearningschool.con or read details from the book Statistics for Big Data. http://bangla.salearningschool.com/recent-posts/category/statistics-for-big-data/
https://medium.com/@SayedAhmedCanada/important-basic-concepts-statistics-for-big-data-46b4e3f2177d
https://medium.com/@SayedAhmedCanada/best-practices-in-data-preparation-407ecefd58d1
Chat with the Participants