How to Detect ChatGPT-Generated Text?

2023-01-11

Discover the latest methods for distinguishing machine-generated text from the human-written text. Learn about statistical, syntactic, semantic, and neural network-based approaches. Stay up-to-date with the latest research in NLP and AI.


Continue reading

Maximizing Efficiency in MLOps - How Blue/Green Deployment Can Help?

2023-01-03

Learn about blue/green deployment in MLOps, its usefulness and when to use it, and the cost and complexity of maintaining two separate environments


Continue reading

Grammar of Graphics Explained Like to 5 Years Old Child.

2022-12-10

Learn the difference between imperative & grammar of graphics approach to plotting. Imprerative approach is simple and easy to understand, but grammar of graphics approach is more advanced and efficient. Examples with matplotlib and plotly express included.


Continue reading

MLOps Certifications - A Comprehensive Guide

2022-11-27

Learn about popular MLOps certifications offered by Amazon, Google, and Microsoft. Understand if they are worth the effort and money and discover free MLOps certifications available to gain knowledge and understanding of the field.


Continue reading

Roles in MLOps

2022-11-12

Learn about the different roles in MLOps and the responsibilities of each role, including Model Deployment Engineer, Data pipeline Engineer, Model Monitoring Engineer, Model Governance Engineer, Machine Learning Infra Engineer and Machine Learning Platform Engineer.


Continue reading

50 Questions for MLOps Engineer Job Interview

2022-11-02

Get ready for your next MLOps Engineer interview with our comprehensive list of 50+ questions. Covering topics like deployment, management, data pipeline, monitoring, and more.


Continue reading

MLOps Roles of the Future

2022-10-28

Discover the future of MLOps specializations, including Explainable AI/MLOps, Federated Learning/Edge MLOps, Reinforcement Learning/MLOps, AI/ML in IoT and IIoT, Model Explainability and Fairness.


Continue reading

Is MLOps a Good Career?

2022-10-22

Is a career in MLOps right for you? Learn about the pros and cons of this growing field, including high demand, high earning potential, exciting work, and career growth opportunities.


Continue reading

Visual Text Exploration as Part of Preprocessing Before Classification

2022-10-11

This post discusses importance of visual text exploration in preprocessing for classification, covers techniques (wordcloud, Sentiment Analysis, topic modeling, data cleaning) & how to use them with popular libraries. Encourages readers to try for own projects.


Continue reading

10 Lesser-Known, Yet Powerful Python Plotting Libraries

2022-09-30

The most widely used libraries for plotting in python are matplotlib, Plotly, seaborn, and bokeh. This article describes 10 other powerful plotting libraries available in Python that can be used to create high-quality plots and visualizations.


Continue reading

Automated Signal Segmentation, Trend Detection, and Classification

2022-09-12

This post presents the trend-classifier package that can be used for signal segmentation into parts where the trend is coherent.


Continue reading

The Best Self-Hosted, Open Source RSS Feed Readers in 2022

2022-08-31

Best self-hosted RSS Feed Readers selected by popularity and project activity.


Continue reading

How to Convert Pandas DataFrame to Markdown Table?

2022-07-05

  • pandas
  • table
  • markdown
  • conversion Category: note

up: MOC_Pandas

Generating an ASCII table from a pandas DataFrame can be accomplished in several ways …


Continue reading

Discovering Hidden Gems - Popular and Lesser-Known Dataset Sharing Platforms

2022-06-09

"Looking for the key to unlocking valuable datasets? Dive into the world of Kaggle, UCI, and more as we unveil the best platforms for data enthusiasts."


Continue reading

Lesser Known Backtesting Libraries

2022-06-05

This article presents a set of lesser-known but interesting libraries that can be used for backtesting trading strategies and trading algorithms in general.


Continue reading

Top 10 Python Libraries for Document Classification

2022-05-01

Unlock the power of document classification with these top Python libraries! Discover the best tools for effortless text analysis and more.


Continue reading

Tutorial on How to Create New Theme for Pelican Static Site Generator

2022-03-01

A great Pelican theme can make all the difference. Learn how to design your own with our comprehensive tutorial.


Continue reading

Pro Tips for Diagnosing Regression Model Errors

2022-02-22

Improve your regression model's accuracy and predictability by uncovering hidden errors with these essential plots.


Continue reading

15 Tools for Document Deskewing and Dewarping

2022-02-11

Sometimes input for document processing tasks such as OCR, table detection or text segmentation can be scanned or photo taken from hand that do not have ideal perspective - is rotated or spatially distorted in some way (warped document). If you are looking for my recommendations go straight to the last section of this article "Summary and recommendations".


Continue reading

How to Get Most of GitHub Copilot

2022-01-25

This post describes techniques that help to get most accurate suggestions from the GitHub Copilot "Your AI pair programmer". For those who never heard of Copilot there is short introduction, if you already know Copilot - you can jump directly to section 4 - "How to get most of GitHub Copilot".


Continue reading

Understanding Micro and Macro Averages in Multiclass Multilabel Problems

2021-12-22

Learn about micro and macro averages in multiclass multilabel problems, the difference between multiclass and multilabel problems and when to use micro and macro averages.


Continue reading

Setup VS Code as NIM IDE

2021-12-02

Learn how to set up VSCode as a Nim IDE from scratch. Includes instructions for downloading VSCode, installing the Nim extension, configuring settings, and debugging Nim code.


Continue reading

Top Popular ZSH Plugins on GitHub (2021)

2021-11-29

Explore the most popular Zsh plugins from the 1800+ options on the Awesome Zsh plugins GitHub project. See which ones have the highest number of stars from the Zsh community.


Continue reading

Unleashing the Power of T-Sne for Dimensionality Reduction in Python

2021-03-15

Want to create beautiful visualizations from complex data? Discover the power of T-SNE for dimensionality reduction in Python.


Continue reading

Kurtosis in Simple Terms, Interpretation and Gotchas

2021-02-18

Statistics can be tricky, but understanding kurtosis is a must for anyone who wants to avoid making common mistakes in statistical analyses. Learn how to interpret it in this comprehensive guide.


Continue reading

Finding Errors in Data - Data Validation

2021-01-31

Explore methods to detect & fix errors in data, including validation, visualizations, statistical tests, cleaning techniques, machine learning & data quality tools. Get concise, easy to understand information with examples & links to external resources.


Continue reading

Pandas Schema Validation

2021-01-16

Overview of the available tools and methods for schema validation in pandas, examplary code snippets and recommendation for when to use given tool.


Continue reading

Evaluation of Interpretability for Explainable AI

2020-11-05

Learn about the evaluation of interpretability in machine learning with this guide. Discover different levels and methods for assessing the explainability of models.


Continue reading

Metrics Used to Compare Histograms

2020-01-19

Learn about metrics used to compare histograms with examples of how to calculate them in python. From Chi-Squared distance to Kullback-Leibler divergence and Earth Mover's distance. A comprehensive guide.


Continue reading

Interactive plots for blogging

2019-09-13

Using Plotly, Bokeh and Altair for interactive visualizations in the blog posts.


Continue reading

Data Science Command-Line Tools

2019-08-23

Description of GNU utils and other less standard tools that helps with processing data from CLI or with shell scripts.


Continue reading

Finding the Spy - Post on Markov Chains and Stochastic Matrices

2019-08-10

Using puzzle on tracing the high profile spy as excuse to showcase Markov Chains and demonstrate usage and properties e.g. Stationary distribution


Continue reading

Top Popular ZSH Plugins on GitHub (2019)

2019-07-14

On the GitHub project Awesome Zsh plugins you can find 1700+ links to plugins, themes, and Zsh plugin managers/frameworks. The number of tools listed on that page is high and it is difficult to get orientation on which plugins gained already a good reputation from the Zsh users community. This post aims at identifying the most popular tools where popularity is measured by the number of stars that Github users added to a given plugin or tool.


Continue reading

Learn Bayesian Methods in 4 Steps - By Reading and by Doing

2019-07-09

This post proposes a 4-step path for learning Bayesian methods. The first step is going through the book "Bayesian methods for hackers", second, using complementary books for probability and statistics, the third, reading How to become a Bayesian in eight easy steps, and last, going through the book full of exercises - "Think Bayes".


Continue reading

Kaggle Evaluation Metrics Used for Regression Problems

2019-02-16

"This post describe evaluation metrics used in Kaggle competitions where problem to solve is has regression nature. Eight different metrics are described, namely - Absolute Error (AE), Mean Absolute Error (MAE), Weighted Mean Absolute Error (WMAE), Pearson Correlation Coefficient, Spearman\u2019s Rank Correlation, Root Mean Squared Error (RMSE), Root Mean Squared Logarithmic Error (RMSLE), Mean Columnwise Root Mean Squared Error (MCRMSE)."


Continue reading

How to Install TensorFlow and Keras on Windows 10

2019-01-17

Guide on how to install TensorFlow cpu-only version - the case for machines without GPU supporting CUDA. Step-by-step procedure starting from creating conda environment till testing if TensorFlow and Keras Works.


Continue reading

Darwin Approach to Traveling Salesman

2019-01-12

Can the evolutionary approach crash the problem that brute-forcing will last far more than the age of the universe? This post shows how to attack the Traveling Salesman Problem using Darwin's approach. I'm describing the evolution model and design decisions. See the animation of how the population was evolving through the epochs.


Continue reading

How to Organize Data Science Project Based on Jupyter Notebook

2019-01-05

Having several notebook-based projects behind you might result in a mess in the projects directory. Organize your Data Science project based on Jupyter notebooks in a way that one can navigate through it. Especially that "the one" will be most probably you in a few months time. To achieve that, keep your projects directory clean, name the project in a descriptive way and take care of the internal structure of the project.


Continue reading

What's Cooking

2018-04-05

Exploratory Data Analysis of the Kaggle's "What's cooking" competition dataset to get understanding what kind of data we are dealing with and get intuition of existing dependencies.


Continue reading

Blockchain Implementation

2018-03-26

Python implementation of blockchain in few lines of code.


Continue reading

Top Popular ZSH Plugins on GitHub

2018-03-22

There is an exhaustive but curated list of Zsh plugins posted on GitHub project Awesome Zsh plugins. You can find there 800+ links to plugins, themes and Zsh plugin managers/frameworks. Even though it is a collection of awesome stuff the number is a bit high get orientation which plugins gained already good reputation from Zsh users community. In this post I will identify most popular plugins - those which have the highest number of stars.


Continue reading