What makes a Great Hacker(maker) ?

“Let me try to explain to you, what to my taste is characteristic for all intelligent thinking. It is, that one is willing to study in depth an aspect of one’s subject matter in isolation for the sake of

Getting started with Data Engineering in Spark

“Overall, machine learning systems can be thought of as a machine learning core — usually an advanced algorithm which requires a few chapters from Ian’s book to understand — surrounded by a huge amount of software engineering. The engineering can be

Collaborative Filtering Tutorial

Recommendation system is a specific type of information filtering technique that attempts to present information items (such as movies, music, web sites, news) that are likely of interest to the user. Traditionally, there are two methods to construct a recommender

Enterprise Information Retrieval using semantic search

Latent Semantic Analysis (LSA) is the most popular technique of Corpus-Based similarity. One way to tackle this problem is to use Latent Semantic Analysis[1]. The technique was proposed by Deerwester et. al and takes advantage of the implicit higher-order

Notes on Numpy Performance improvement

Disclaimer: These are complied notes from various sources. Apologies if some references are missing. When conducting experiments with Gensim(an awesome NLP library), i noticed i wasn't getting the same results as fast as the authors claimed. This