Data Science Resources

Books Machine Learning An Introduction to Machine Learning with Python Python Machine Learning – Sebastian Raschka Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More Logistic Regression: From Introductory to Advanced Concepts and Applications – Scott Menard Introduction to Linear Regression Analysis (Wiley Series in Probability and Statistics Introduction to Time…

Continue reading →

Clustering & Retrieval

Retrieval is a task of retrieving documents/person/item of interest from a corpus. Clustering is finding out, what are related groups of people or items. Use Cases: If someone is listening to a song / watching movie/tv shows, she would be interested in watching similar song/movie/show. If someone on the e-commerce website/app, likes/purchase a product, she…

Continue reading →

Social Network Analysis – SNA

Social network analysis [SNA] is the mapping and measuring of relationships and flows between people, groups, organizations, computers, URLs, and other connected information/knowledge entities. The nodes in the network are the people and groups while the links show relationships or flows between the nodes.[Ref] Relevant questions w.r.t. Social Network analysis What patterns are created by…

Continue reading →

My Tools Box

Puppet: It is an open-source software configuration management tool. Puppet Tutorial for Beginners. Alternatives are Saltstack, Ansible, Chef. Vagrant: Vagrant provides easy to configure, reproducible, and portable work environments built on top of industry-standard technology and controlled by a single consistent workflow to help maximize the productivity and flexibility of you and your team.To achieve its…

Continue reading →

Fork me on GitHub