Title here
Summary here
A practical walkthrough of descriptive statistics in Python: measures of center, measures of spread, and percentiles, computed with pandas and NumPy on a real restaurant tipping dataset.
July 3, 2026 in Statistics, Python by Mehdi Lotfinejad9 minutes
pandas covers most data-cleaning jobs, but not all of them. This guide surveys three libraries worth keeping in your toolbox — numpy for vectorized numeric fixes, re for pattern-based text cleaning, and rapidfuzz for catching near-duplicate rows exact matching can't see — each demonstrated on one small, reproducible dataset.
July 3, 2026 in Python, Data Analysis by Mehdi Lotfinejad12 minutes