Processing a Trillion Rows Per Second on a Single Machine: How Can Nested Loop Joins be this Fast?