stats

Comparing string and md5 hash dissimilarity using hashlib and jellyfish

Demonstrating that similarity of pairs of strings is uncorrelated to Jaro and Damerau-Levenshtein similarity of their md5 hashes.

Truncated (by number of standard deviations) normal distribution in Python

A quick way to generate "normally" distributed data but exclude extreme values.

Python examples for visualising normal and skewed normal distributions with histograms

Some code snippets for generating and visualising normal and skewed data, and comparing their mean, quartiles and median.