Data Science Command line Tools

Posted on Fri 23 August 2019 in Posts • Tagged with machine learning, linux • 5 min read

Finding the spy - post on Markov Chains and stochastic matrices

Posted on Sat 10 August 2019 in Posts • Tagged with python, jupyter, statistics, markow chains • 26 min read

Using puzzle on tracing the high profile spy as excuse to showcase Markov Chains and demonstrate usage and properties e.g. Stationary distribution

Top CLI tools that improve work in console in 2019

Posted on Sun 14 July 2019 in Posts • Tagged with zsh, scrapping, python • 2 min read

On Github project Awesome Zsh plugins you can find 1700+ links to plugins, themes and Zsh plugin managers/frameworks. The number of tools listed on that page is high and it is difficult to get orientation which plugins gained already good reputation from Zsh users community. This post aims at identifying most popular tools where popularity is measured with the number of stars that Github users added to given plugin or tool.

Learn Bayesian methods in 4 steps - by reading and by doing

Posted on Tue 09 July 2019 in Posts • Tagged with machine learning, statistics, probability • 3 min read

This post propose 4-steps path for learning Byesian methods. First step is goint through the book: "Bayesian methods for hackers", second, use complementary books for probability and statistics, third, read How to become a Bayesian in eight easy steps: An annotated reading list", and last, go throught the book full of exercises: "Think Bayes".

Kaggle evaluation metrics used for regression problems

Posted on Sat 16 February 2019 in Posts • Tagged with machine learning, evaluation, metrics, performance • 7 min read

This post describe evaluation metrics used in Kaggle competitions where problem to solve is has regression nature. Eight different metrics are described, namely: Absolute Error (AE), Mean Absolute Error (MAE), Weighted Mean Absolute Error (WMAE), Pearson Correlation Coefficient, Spearmanâ€™s Rank Correlation, Root Mean Squared Error (RMSE), Root Mean Squared Logarithmic Error (RMSLE), Mean Columnwise Root Mean Squared Error (MCRMSE).

How to install TensorFlow and Keras on Windows 10

Posted on Thu 17 January 2019 in Posts • Tagged with machine learning, tensorflow, howto • 2 min read

Guide on how to install TensorFlow cpu-only version - the case for machines without GPU supporting CUDA. Step-by-step procedure starting from creating conda environment till testing if TensorFlow and Keras Works.

Darwin Approach to Traveling Salesman

Posted on Sat 12 January 2019 in Posts • Tagged with machine learning, evolutionary • 6 min read

Can evolutionary approach crash the problem that brute forcing will last far more that the age of universe? This post shows how to attack Traveling Salesman Problem using Darwin approach. I'm describing evolution model and design decisions. See the animation how the population was evolving through the epochs.

How to organize Data Science project based on Jupyter notebook

Posted on Sat 05 January 2019 in Posts • Tagged with jupyter, python, notebook, howto, machine learning • 4 min read

Having several notebook-based projects behind you might result in mess in projects directory. Organize your Data Science project based on Jupyter notebooks in a way that one can navigate through it. Especially that "the one" will be most probably you in few months time. To achieve that: keep your projects directory clean, name the project in a descriptive way and take care of internal structure of the project.

What's cooking

Posted on Thu 05 April 2018 in Posts • Tagged with python, jupyter, kaggle, NLP, EDA, machine learning • 13 min read

Exploratory Data Anlysis of the Kaggle's "What's cooking" competition dataset to get understanding what kind of data we are dealing with and get intuition of existing dependecies.

Blockchain implementation

Posted on Mon 26 March 2018 in Posts • Tagged with blockchain, cryptocurency, python, jupyter • 12 min read

Python implementation of blockchain in few lines of code.