While performing some github repository cleaning in May 2020, I found a collection of blogs from my first blog site which I didn't stick with. This article is a list of references and tidbits I found useful early on.
List of links
- Gamedev Python quick reference http://www.cogsci.rpi.edu/~destem/gamedev/python.pdf
- List comprehension - https://datasciencelab.wordpress.com/2014/01/08/list-comprehension-in-python/
- Joshua Lande, Senior Data Scientist at Twitter - Learning mySQL - http://joshualande.com/archive/
- Using R in iPython - http://stats.stackexchange.com/questions/1595/python-as-a-statistics-workbench
- Using SQL within iPython - http://www.slideshare.net/depropiedades/ipython
- Cython https://www.kaggle.com/c/word2vec-nlp-tutorial/forums/t/11197/gensim-word2vec-cython-on-windows
- Data Science - Baysian optimisation for validation parameters - http://jmhessel.github.io/Bayesian-Optimization/
- Multiple classifiers and classification calibration - http://blog.kaggle.com/2015/06/15/dont-miss-these-scripts-otto-group-product-classification/
- MOA - http://moa.cms.waikato.ac.nz/
- WEKA - http://markahall.blogspot.co.nz/?m=1
- Recurrent neural networks - Good for text analysis when >100k cases. Makes features. - https://clip.mn/video/yt-VINCQghQRuM
- Analysing data checklist - Good guide on methodology for approaching analysis - http://www.analyticsvidhya.com/blog/2015/02/data-exploration-preparation-model/
- Area under the curve - How to judge an ROC curve - http://gim.unmc.edu/dxtests/roc3.htm
- CSV fingerprint - Excellent tool for assessing CSV quality issues http://setosa.io/csv-fingerprint/
- Permitations calculator http://www.mathsisfun.com/combinatorics/combinations-permutations-calculator.html
- Tutorials for learning pandas https://bitbucket.org/hrojas/learn-pandas
- Hacking excel http://superuser.com/questions/807926/how-to-bypass-the-vba-project-password-from-excel