Clustering & Retrieval

Retrieval is a task of retrieving documents/person/item of interest from a corpus. Clustering is finding out, what are related groups of people or items. Use Cases: If someone is listening to a song / watching movie/tv shows, she would be interested in watching similar song/movie/show. If someone on the e-commerce website/app, likes/purchase a product, she…

Social Network Analysis – SNA

Social network analysis [SNA] is the mapping and measuring of relationships and flows between people, groups, organizations, computers, URLs, and other connected information/knowledge entities. The nodes in the network are the people and groups while the links show relationships or flows between the nodes.[Ref] Relevant questions w.r.t. Social Network analysis What patterns are created by…

My Tools Box

Puppet: It is an open-source software configuration management tool. Puppet Tutorial for Beginners. Alternatives are Saltstack, Ansible, Chef. Vagrant: Vagrant provides easy to configure, reproducible, and portable work environments built on top of industry-standard technology and controlled by a single consistent workflow to help maximize the productivity and flexibility of you and your team.To achieve its…

Tame the Anaconda (Python)

If you planning to work on python, you can download python’s Anaconda distribution and install it on your system. conda is the main command you will be dealing with while working on Anaconda. You can create many virtual environments with different versions of python and with different sets of tools. Conda Commands Check version of conda:


