Python Programming Tutorials

Data from the Internet for Matplotlib

Aside from loading data from the files, another popular source for data is the internet. We can load data from the internet from a variety of ways, but, for us, we're going to just simply read the source code of the website, then use simple splitting to separate the data.

Update: Yahoo finance removed their API, so I have hosted an example that will still work. Feel free to adapt the code to another source if you know of one.

import matplotlib.pyplot as plt
import numpy as np
import urllib
import matplotlib.dates as mdates


def graph_data(stock):

    # Unfortunately, Yahoo's API is no longer available
    # feel free to adapt the code to another source, or use this drop-in replacement.
    stock_price_url = 'https://pythonprogramming.net/yahoo_finance_replacement'

    source_code = urllib.request.urlopen(stock_price_url).read().decode()

    stock_data = []
    split_source = source_code.split('\n')

    for line in split_source[1:]:
        split_line = line.split(',')
        if len(split_line) == 7:
            if 'values' not in line:
                stock_data.append(line)

There's a lot going on here. First, we see the imports. Pyplot is imported as usual, then numpy, then urllib for accessing the internet, and then matplotlib.dates as mdates, which is useful for converting date stamps to dates that matplotlib can understand.

Next, we begin to build our "graph_data" function. In here, we first define the url that contains the stock data. After that, we write some urllib code to access that url, then read, with .read, the source code, then we go ahead and decode the data. If you're following along with Python 2, it wont be necessary to use decode.

Then, we define an empty list, which is where we'll be placing the stock data shortly, and we also begin to split up the data with the split_source variable, splitting by new lines.

Now, if you go to look at the source code, replacing the "+stock+" in the url with a stock, like AAPL, you can see that most of the page data is indeed stock pricing information, but there is some header information there that we need to filter out. To do this, we'll just use some rudimentary filtration, checking to make sure there are 6 data points per line, and then making sure the term "values" isn't in the line.

Now, we have the data parsed out, and we're ready to munch on it. We're going to use NumPy for this:

    date, closep, highp, lowp, openp, adj_closep, volume = np.loadtxt(stock_data,
                                                          delimiter=',',
                                                          unpack=True,
                                                          # %Y = full year. 2015
                                                          # %y = partial year 15
                                                          # %m = number month
                                                          # %d = number day
                                                          # %H = hours
                                                          # %M = minutes
                                                          # %S = seconds
                                                          # 12-06-2014
                                                          # %m-%d-%Y
                                                          converters={0: bytespdate2num('%Y-%m-%d')})

What we are doing here, is unpacking these six elements to six variables, with numpy's loadtxt function. The first parameter here is stock_data, which is the data we're loading. Then, we specify the delimiter, which is a comma in this case, then we specify that we indeed want to unpack the variables here not just to one variable, but to this group of variables we've defined. Finally, we use the optional "converters" parameter to specify what element we want to convert (0), and then how we want to do that. We pass a function called bytespdate2num, which doesn't quite exist yet, but we'll write that next.

If you would like to know more about the syntax for date data, check out the posted video, around 9 minutes 35 seconds.

There exists 1 challenge(s) for this tutorial. for access to these, video downloads, and no ads.

The next tutorial:

Introduction to Matplotlib and basic line
Legends, Titles, and Labels with Matplotlib
Bar Charts and Histograms with Matplotlib
Scatter Plots with Matplotlib
Stack Plots with Matplotlib
Pie Charts with Matplotlib
Loading Data from Files for Matplotlib
Data from the Internet for Matplotlib
Converting date stamps for Matplotlib
Basic customization with Matplotlib
Unix Time with Matplotlib
Colors and Fills with Matplotlib
Spines and Horizontal Lines with Matplotlib
Candlestick OHLC graphs with Matplotlib
Styles with Matplotlib
Live Graphs with Matplotlib
Annotations and Text with Matplotlib
Annotating Last Price Stock Chart with Matplotlib
Subplots with Matplotlib
Implementing Subplots to our Chart with Matplotlib
More indicator data with Matplotlib
Custom fills, pruning, and cleaning with Matplotlib
Share X Axis, sharex, with Matplotlib
Multi Y Axis with twinx Matplotlib
Custom Legends with Matplotlib
Basemap Geographic Plotting with Matplotlib
Basemap Customization with Matplotlib
Plotting Coordinates in Basemap with Matplotlib
3D graphs with Matplotlib
3D Scatter Plot with Matplotlib
3D Bar Chart with Matplotlib
Conclusion with Matplotlib