INTRO TO DATA SCIENCE: ARTIFICIAL INTELLIGENCE—AT THE INTERSECTION OF CS AND DATA SCIENCE
Artificial-Intelligence Milestones
- In a 1997 match between IBM’s DeepBlue computer system and chess Grandmaster Gary Kasparov, DeepBlue became the first computer to beat a reigning world chess champion under tournament conditions. IBM loaded DeepBlue with hundreds of thousands of grandmaster chess games. DeepBlue was capable of using brute force to evaluate up to 200 million moves per second! This is big data at work. IBM received the Carnegie Mellon University Fredkin Prize, which in 1980 offered $100,000 to the creators of the first computer to beat a world chess champion.
- https://en.wikipedia.org/wiki/Deep_Blue_versus_Garry_Kasparov.
- https://en.wikipedia.org/wiki/Deep_Blue_(chess_computer).
- In 2011, IBM’s Watson beat the two best human Jeopardy! players in a $1 million match. Watson simultaneously used hundreds of languageanalysis techniques to locate correct answers in 200 million pages of content (including all of Wikipedia) requiring four terabytes of storage. Watson was trained with machine learning and reinforcement-learning techniques. Chapter 13 discusses IBM Watson and Chapter 14 discusses machinelearning.
- Go—a board game created in China thousands of years ago —is widely considered to be one of the most complex games ever invented with 10 possible board configurations. To give you a sense of how large a number that is, it’s believed that there are (only) between 10 and 10 atoms in the known universe! In 2015, AlphaGo—created by Google’s DeepMind group—used deep learning with two neural networks to beat the European Go champion Fan Hui. Go is considered to be a far more complex game than chess. Chapter 15 discusses neural networks and deep learning.
- More recently, Google generalized its AlphaGo AI to create AlphaZero—a gameplaying AI that teaches itself to play other games. In December 2017, AlphaZero learned the rules of and taught itself to play chess in less than four hours using reinforcement learning. It then beat the world champion chess program, Stockfish 8, in a 100game match—winning or drawing every game. After training itself in Go for just eight hours, AlphaZero was able to play Go vs. its AlphaGo predecessor, winning 60 of 100 games.
A Personal Anecdote
Watson and Big Data Open New Possibilities
When Paul and I started working on this Python article, we were immediately drawn to IBM’s Watson using big data and artificialintelligence techniques like natural language processing (NLP) and machine learning to beat two of the world’s best human Jeopardy! players. We realized that Watson could probably handle problems like the sequence predictor because it was loaded with the world’s street maps and a whole lot more. That chet our appetite for digging in deep on big data and today’s artificialintelligence technologies, and helped shape Chapters 11–16 of this article. It’s notable that all of the datascience implementation case studies in Chapters 11–16 either are rooted in artificial intelligence technologies or discuss the big data hardware and software infrastructure that enables computer scientists and data scientists to implement leading-edge AI-based solutions effectively.
AI: A Field with Problems But No Solutions
For many decades, AI has been viewed as a field with problems but no solutions. That’s because once a particular problem is solved people say, “Well, that’s not intelligence, it’s just a computer program that tells the computer exactly what to do.” However, with machine learning (Chapter 14) and deep learning (Chapter 15) we’re not preprogramming solutions to specific problems. Instead, we’re letting our computers solve problems by learning from data—and, typically, lots of it. Many of the most interesting and challenging problems are being pursued with deep learning. Google alone has thousands of deeplearning projects underway and that number is growing quickly. As you work through this article, we’ll introduce you to many edge-of-the-practice artificial intelligence, big data and cloud technologies.