Goto

Collaborating Authors

 essential command line tool


KDnuggets News 18:n13, Mar 28: Where did you apply Data Science/ML? 12 Essential Command Line Tools for Data Scientists

#artificialintelligence

Top Stories, Mar 19-25: 5 Things You Need to Know about Sentiment Analysis and Classification; Top 12 Essential Command Line Tools for Data Scientists Top KDnuggets tweets, Mar 14-20: Introduction to Markov Chains "What are Markov chains, when to use them, and how they work"


Top 12 Essential Command Line Tools for Data Scientists

@machinelearnbot

This post is a short overview of a dozen Unix-like operating system command line tools which can be useful for data science tasks. The list does not include any general file management commands (pwd, ls, mkdir, rm, ...) or remote session management tools (rsh, ssh, ...), but is instead made up of utilities which would be useful from a data science perspective, generally those related to varying degrees of data inspection and processing. They are all included within a typical Unix-like operating system as well. It is admittedly elementary, but I encourage you to seek out additional command examples where appropriate. Tool names link to Wikipedia entries as opposed to man pages, as the former are generally more friendly to newcomers, in my view.