Madalina CiortanJul 26, 20196 minOverview of feature selection methodsCommon strategies for choosing the most relevant features in your data set The importance of feature selection Selecting the right set of features to be used...

Madalina CiortanJun 22, 20194 minClustering data with graph oriented techniquesIntroduction to Louvain graph community detection This post will show an alternative approach to clustering a dataset, which relies on graph oriented techniq...

Madalina CiortanJun 8, 20194 minAn introduction to frequent pattern mining researchSummary of Apriori, Eclat and FP tree algorithms What are frequent patterns? Frequent patterns are collections of items which appear in a data set at an impo...

Madalina CiortanApr 7, 20194 minSubspace clusteringChallenges in high dimensional spaces This post addresses the following questions: 1. What are the challenges of working with high dimensional data? 2. What ...

Madalina CiortanMar 27, 20193 minGentle introduction to Echo State Networks This post will address the following questions: - What are Echo State Networks? - Why and when should you use an Echo State Network? - Simple implementation ...

Madalina CiortanMar 17, 20192 minBayesian multiple change point detectionRecursive estimation of change points Change points are abrupt changes in a sequence of observations which split the observed data in one (single change poin...

Madalina CiortanMar 13, 20192 minSimple example of 2D density plots in pythonHow to visualize joint distributions This post will show you how to: Use a Gaussian Kernel to estimate the PDF of 2 distributionsUse Matplotlib to represent ...

Madalina CiortanMar 10, 20198 minOverview of data distributionsHow to choose the right distribution to model your data There are over 20 different types of data distributions (applied to the continuous or the discrete sp...

Madalina CiortanMar 9, 20194 minExpectation maximizationFinding the Maximum Likelihood Estimate of a model depending on unobserved latent variables This article presents a basic python implementation of the expect...

Madalina CiortanDec 31, 20182 minSpectral clustering demystifiedLet’s start by introducing some basing graph theory notions. Adjacency matrix (A) Given a graph with n vertices and m nodes, the adjacency matrix is a square...

Madalina CiortanDec 2, 20181 minUnimodality tests and Kernel density estimationsWhen processing a large number of datasets which can potentially have different data distributions, we are confronted with the following considerations: - Is...