15. Deep Learning

One of AI’s most exciting areas is deep learning, a powerful subset of machine learning that has produced impressive results in computer vision and many other areas over the last few years. The availability of big data, significant processor power, faster Internet speeds and advancements in parallel computing hardware and software are making it possible for more organizations and individuals to pursue resource-intensive deep-learning solutions.

Keras and TensorFlow

In the previous chapter, Scikit-learn enabled you to define machine-learning models conveniently with one statement. Deep learning models require more sophisticated setups, typically connecting multiple objects, called layers. We’ll build our deep learning models with Keras, which offers a friendly interface to Google’s TensorFlow—the most widely used deep-learning library.¹ François Chollet of the Google Mind team developed Keras to make deep-learning capabilities more accessible. His book Deep Learning with Python is a must read.² Google has thousands of TensorFlow and Keras projects underway internally and that number is growing quickly.³,⁴

¹Keras also serves as a friendlier interface to Microsofts CNTK and the Université de Montréals Theano- (which ceased development in 2017). Other popular deep learning frameworks include Caffe (http://caffe.berkeleyvision.org/), Apache MXNet (https://mxnet.apache.org/) and PyTorch (https://pytorch.org/).

²Chollet, François. Deep Learning with Python. Shelter Island, NY: Manning Publications, 2018.

³http://theweek.com/speedreads/654463/google-more-than-1000-artificial-intelligence-projects-works.

⁴https://www.zdnet.com/article/google-says-exponential-growth-of-ai-is-changing-nature-of-compute/.

Models

Deep learning models are complex and require an extensive mathematical background to understand their inner workings. As we’ve done throughout the book, we’ll avoid heavy mathematics here, preferring English explanations.

Keras is to deep learning as Scikit-learn is to machine learning. Each encapsulates the sophisticated mathematics, so developers need only define, parameterize and manipulate objects. With Keras, you build your models from pre-existing components and quickly parameterize those components to your unique requirements. This is what we’ve been referring to as object-based programming throughout the book.

Experiment with Your Models

Machine learning and deep learning are empirical rather than theoretical fields. You’ll experiment with many models, tweaking them in various ways until you find the models that perform best for your applications. Keras facilitates such experimentation.

Dataset Sizes

Deep learning works well when you have lots of data, but it also can be effective for smaller datasets when combined with techniques like transfer learning^5,6 and data augmentation^7,8. Transfer learning uses existing knowledge from a previously trained model as the foundation for a new model. Data augmentation adds data to a dataset by deriving new data from existing data. For example, in an image dataset, you might rotate the images left and right so the model can learn about objects in different orientations. In general, though, the more data you have, the better you’ll be able to train a deep learning model.

⁵https://towardsdatascience.com/transfer-learning-from-pre-trained-models-f2393f124751.

⁶https://medium.com/nanonets/nanonets-how-to-use-deep-learning-when-you-have-limited-data-f68c0b512cab.

⁷https://towardsdatascience.com/data-augmentation-and-images-7aca9bd0dbe8.

⁸https://medium.com/nanonets/how-to-use-deep-learning-when-you-have-limited-data-part-2-data-augmentation-c26971dc8ced.

Processing Power

Deep learning can require significant processing power. Complex models trained on big-data datasets can take hours, days or even more to train. The models we present in this chapter can be trained in minutes to just less than an hour on computers with conventional CPUs. You’ll need only a reasonably current personal computer. We’ll discuss the special high-performance hardware called GPUs (Graphics Processing Units) and TPUs (Tensor Processing Units) developed by NVIDIA and Google to meet the extraordinary processing demands of edge-of-the-practice deep-learning applications.

Bundled Datasets

Keras comes packaged with some popular datasets. You’ll work with two of these datasets in the chapter’s examples. You can find many Keras studies online for each of these datasets, including ones that take different approaches.

In the “Machine Learning” chapter, you worked with Scikit-learn’s Digits dataset, which contained 1797 handwritten-digit images that were selected from the much larger MNIST dataset (60,000 training images and 10,000 test images).⁹ In this chapter you’ll work with the full MNIST dataset. You’ll build a Keras convolutional neural network (CNN or convnet) model that will achieve high performance recognizing digit images in the test set. Convnets are especially appropriate for computer vision tasks, such as recognizing handwritten digits and characters or recognizing objects (including faces) in images and videos. You’ll also work with a Keras recurrent neural network. In that example, you’ll perform sentiment analysis using the IMDb Movie reviews dataset, in which the reviews in the training and testing sets are labeled as positive or negative.

⁹The MNIST Database. MNIST Handwritten Digit Database, Yann LeCun, Corinna Cortes and Chris Burges. http://yann.lecun.com/exdb/mnist/.

Future of Deep Learning

Newer automated deep learning capabilities are making it even easier to build deep-learning solutions. These include Auto-Keras¹⁰ from Texas A&M University’s DATA Lab, Baidu’s EZDL¹¹ and Google’s AutoML¹².

¹⁰https://autokeras.com/.

¹¹https://ai.baidu.com/ezdl/.

¹²https://cloud.google.com/automl/.

15.1.1 Deep Learning Applications

Deep learning is being used in a wide range of applications, such as:

Game playing
Computer vision: Object recognition, pattern recognition, facial recognition
Self-driving cars
Robotics
Improving customer experiences
Chatbots
Diagnosing medical conditions
Google Search
Facial recognition
Automated image captioning and video closed captioning
Enhancing image resolution
Speech recognition
Language translation
Predicting election results
Predicting earthquakes and weather
Google Sunroof to determine whether you can put solar panels on your roof
Generative applications—Generating original images, processing existing images to look like a specified artist’s style, adding color to black-and-white images and video, creating music, creating text (books, poetry) and much more.

15.1.2 Deep Learning Demos

Check out these four deep-learning demos and search online for lots more, including practical applications like we mentioned in the preceding section:

DeepArt.io—Turn a photo into artwork by applying an art style to the photo. https://deepart.io/.
DeepWarp Demo—Analyzes a person’s photo and makes the person’s eyes move in different directions. https://sites.skoltech.ru/sites/compvision_wiki/static_pages/projects/deepwarp/.
Image-to-Image Demo—Translates a line drawing into a picture. https://affinelayer.com/pixsrv/.
Google Translate Mobile App (download from an app store to your smartphone)—Translate text in a photo to another language (e.g., take a photo of a sign or a restaurant menu in Spanish and translate the text to English).

15.1.3 Keras Resources

Here are some resources you might find valuable as you study deep learning:

To get your questions answered, go to the Keras team’s slack channel at https://kerasteam.slack.com.
For articles and tutorials, visit https://blog.keras.io.
The Keras documentation is at http://keras.io.
If you’re looking for term projects, directed study projects, capstone course projects or thesis topics, visit arXiv (pronounced “archive,” where the X represents the Greek letter “chi”) at https://arXiv.org. People post their research papers here in parallel with going through peer review for formal publication, hoping for fast feedback. So, this site gives you access to extremely current research.

15.2 Keras Built-In Datasets

Here are some of Keras’s datasets (from the module tensorflow.keras.datasets¹³) for practicing deep learning. We’ll use a couple of these in the chapter’s examples:

¹³In the standalone Keras library, the module names begin with keras rather than tensorflow.keras.

MNIST^¹⁴ database of handwritten digits—Used for classifying handwritten digit images, this dataset contains 28-by-28 grayscale digit images labeled as 0 through 9 with 60,000 images for training and 10,000 for testing. We use this dataset in Section 15.6, where we study convolutional neural networks.

¹⁴The MNIST Database. MNIST Handwritten Digit Database, Yann LeCun, Corinna Cortes and Chris Burges. http://yann.lecun.com/exdb/mnist/.
Fashion-MNIST^¹⁵ database of fashion articles—Used for classifying clothing images, this dataset contains 28-by-28 grayscale images of clothing labeled in 10 categories¹⁶ with 60,000 for training and 10,000 for testing. Once you build a model for use with MNIST, you can reuse that model with Fashion-MNIST by changing a few statements.
IMDb Movie reviews¹⁷—Used for sentiment analysis, this dataset contains reviews labeled as positive (1) or negative (0) sentiment with 25,000 reviews for training and 25,000 for testing. We use this dataset in Section 15.9, where we study recurrent neural networks.

¹⁵Han Xiao and Kashif Rasul and Roland Vollgraf, Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms, arXiv, cs.LG/1708.07747.

¹⁶https://keras.io/datasets/#fashion-mnist-database-of-fashion-articles.

¹⁷Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011). Learning Word Vectors for Sentiment Analysis. The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011).
CIFAR10¹⁸ small image classification—Used for small-image classification, this dataset contains 32-by-32 color images labeled in 10 categories with 50,000 images for training and 10,000 for testing.

¹⁸https://www.cs.toronto.edu/~kriz/cifar.html.
CIFAR100^¹⁹ small image classification—Also, used for small-image classification, this dataset contains 32-by-32 color images labeled in 100 categories with 50,000 images for training and 10,000 for testing.

¹⁹https://www.cs.toronto.edu/~kriz/cifar.html.

15.3 Custom Anaconda Environments

Before running this chapter’s examples, you’ll need to install the libraries we use. In this chapter’s examples, we’ll use the TensorFlow deep-learning library’s version of Keras.²⁰ At the time of this writing, TensorFlow does not yet support Python 3.7. So, you’ll need Python 3.6.x to execute this chapter’s examples. We’ll show you how to set up a custom environment for working with Keras and TensorFlow.

²⁰Theres also a standalone version that enables you to choose between TensorFlow, Microsofts CNTK or the Université de Montréals Theano (which ceased development in 2017).

Environments in Anaconda

The Anaconda Python distribution makes it easy to create custom environments. These are separate configurations in which you can install different libraries and different library versions. This can help with reproducibility if your code depends on specific Python or library versions.²¹

²¹In the next chapter, well introduce Docker as another reproducibility mechanism and as a convenient way to install complex environments for use on your local computer.

The default environment in Anaconda is called the base environment. This is created for you when you install Anaconda. All the Python libraries that come with Anaconda are installed into the base environment and, unless you specify otherwise, any additional libraries you install also are placed there. Custom environments give you control over the specific libraries you wish to install for your specific tasks.

Creating an Anaconda Environment

The conda create command creates an environment. Let’s create a TensorFlow environment and name it tf_env (you can name it whatever you like). Run the following command in your Terminal, shell or Anaconda Command Prompt:^22,23

²²Windows users should run the Anaconda Command Prompt as Administrator,

²³If you have a computer with an NVIDIA GPU thats compatible with TensorFlow, you can replace the tensorflow library with tensorflow-gpu to get better performance. For more information, see https://www.tensorflow.org/install/gpu. Some AMD GPUs also can be used with TensorFlow: http://timdettmers.com/2018/11/05/which-gpu-for-deep-learning/.

Table of Contents for 15. Deep Learning

Create new playlist

Sign In

Sign Up

15. Deep Learning

15.1 Introduction

Keras and TensorFlow

Models

Experiment with Your Models

Dataset Sizes

Processing Power

Bundled Datasets

Future of Deep Learning

15.1.1 Deep Learning Applications

15.1.2 Deep Learning Demos

15.1.3 Keras Resources

15.2 Keras Built-In Datasets

15.3 Custom Anaconda Environments

Environments in Anaconda

Creating an Anaconda Environment

Activating an Alternate Anaconda Environment

Deactivating an Alternate Anaconda Environment

Jupyter Notebooks and JupyterLab

15.4 Neural Networks

Artificial Neurons

Artificial Neural Network Diagram

Learning Is an Iterative Process

How Artificial Neurons Decide Whether to Activate Synapses

15.5 Tensors

High-Performance Processors

15.6 Convolutional Neural Networks for Vision; Multi-Classification with the MNIST Dataset

Reproducibility in Keras and Deep Learning

Basic Keras Neural Network

Launch JupyterLab

15.6.1 Loading the MNIST Dataset

15.6.2 Data Exploration

Visualizing Digits

15.6.3 Data Preparation

Reshaping the Image Data

Normalizing the Image Data

One-Hot Encoding: Converting the Labels From Integers to Categorical Data

15.6.4 Creating the Neural Network

Adding Layers to the Network

Convolution

Adding a Convolution Layer

Dimensionality of the First Convolution Layer’s Output

Overfitting

Adding a Pooling Layer

Adding Another Convolutional Layer and Pooling Layer

Flattening the Results

Adding a Dense Layer to Reduce the Number of Features

Adding Another Dense Layer to Produce the Final Output

Printing the Model’s Summary

Visualizing a Model’s Structure

Compiling the Model

15.6.5 Training and Evaluating the Model

Evaluating the Model

Making Predictions

Locating the Incorrect Predictions

Visualizing Incorrect Predictions

Displaying the Probabilities for Several Incorrect Predictions

15.6.6 Saving and Loading a Model

15.7 Visualizing Neural Network Training with TensorBoard

Executing TensorBoard

The TensorBoard Dashboard

Copy the MNIST Convnet’s Notebook

Configuring Keras to Write the TensorBoard Log Files

Updating Our Call to fit

15.8 ConvnetJS: Browser-Based Deep-Learning Training and Visualization

Training Stats

Instantiate a Network and Trainer

Network Visualization

Example Predictions on Test Set

15.9 Recurrent Neural Networks for Sequences; Sentiment Analysis with the IMDb Dataset

15.9.1 Loading the IMDb Movie Reviews Dataset

15.9.2 Data Exploration

Movie Review Encodings

Decoding a Movie Review

15.9.3 Data Preparation

Splitting the Test Data into Validation and Test Data

Table of Contents for
15. Deep Learning

Updating Our Call to `fit`