## Overview of feature selection methods

Common strategies for choosing the most relevant features in your data set The importance of feature selection Selecting the right set of...

top of page

Search

Madalina Ciortan

- Jul 26, 2019
- 6 min

Common strategies for choosing the most relevant features in your data set The importance of feature selection Selecting the right set of...

Madalina Ciortan

- Jun 22, 2019
- 4 min

Introduction to Louvain graph community detection This post will show an alternative approach to clustering a dataset, which relies on...

Madalina Ciortan

- Jun 8, 2019
- 4 min

Summary of Apriori, Eclat and FP tree algorithms What are frequent patterns? Frequent patterns are collections of items which appear in a...

Madalina Ciortan

- Apr 7, 2019
- 4 min

Challenges in high dimensional spaces This post addresses the following questions: 1. What are the challenges of working with high...

Madalina Ciortan

- Mar 27, 2019
- 3 min

This post will address the following questions: - What are Echo State Networks? - Why and when should you use an Echo State Network? -...

Madalina Ciortan

- Mar 17, 2019
- 2 min

Recursive estimation of change points Change points are abrupt changes in a sequence of observations which split the observed data in one...

Madalina Ciortan

- Mar 13, 2019
- 2 min

How to visualize joint distributions This post will show you how to: Use a Gaussian Kernel to estimate the PDF of 2 distributionsUse...

Madalina Ciortan

- Mar 10, 2019
- 8 min

How to choose the right distribution to model your data There are over 20 different types of data distributions (applied to the...

Madalina Ciortan

- Mar 9, 2019
- 4 min

Finding the Maximum Likelihood Estimate of a model depending on unobserved latent variables This article presents a basic python...

Madalina Ciortan

- Dec 31, 2018
- 2 min

Let’s start by introducing some basing graph theory notions. Adjacency matrix (A) Given a graph with n vertices and m nodes, the...

Madalina Ciortan

- Dec 2, 2018
- 1 min

When processing a large number of datasets which can potentially have different data distributions, we are confronted with the following...

bottom of page