Resources

A list of resources that I have found usefull.

Following are the resources I personally found usefull from a perpective from learning data science. I will keep adding stuff I found usefull. All the resources are geared mostly towards to python, I will try and post R related material as well. Further, if you are new to coding and are not sure whether to use R or Python, I would suggest python as it is very simple to understand. One of the great resources from I which learned python from is Learn Python the Hard Way

Common Terms

  • Stochastic Process: Just another name for random processes.
  • Error due to Variance: If training a model on different sets of data leads to significance difference in the decision boundaries the model creates, then the model is suffering from error due to variance. It is observed usually when the model is fitted so closely to the training data that it cannot generalize enough.
  • Error due to Bias: This is error is introduced if the machine learning model has some incorrect pre conceived notion about how the predictor and output variables are related.

Online Statistics Courses


Topic Modelling - Text Analysis


Jupyter Notebooks

Data analysis with Python

Other awesome blogs

Other awesome stuff


Also regularly check out kaggle datasets, it contains usually small datasets which are ideal for getting your feet wet. More so over you get to read and learn from other people as well.