Python Archives - wrighters.io

A Gentle Introduction to the Python Match Statement

When new features are added to Python, sometimes it can take a while to learn about and start using the feature. For me, the Python match statement (a.k.a. structural pattern matching) is a good example. Some features are very easy to grasp and use (for example, f-strings), but structural pattern matching is a bit more …

A Gentle Introduction to the Python Match Statement Read More »

Options to run pandas DataFrame.apply in parallel

Python / By Matt Wright

A common use case in pandas is to want to apply a function to rows in a DataFrame. For a novice, the temptation can be to iterate through the rows in the DataFrame and pass the data to a function, but that is not a good idea. (You can read this article for a detailed …

Options to run pandas DataFrame.apply in parallel Read More »

Matching data between data sources with Python

Pandas, Python / By Matt Wright

Data is often messy and rarely in perfect shape. This is especially true if the data comes from many different sources and the specifications are loosely defined. If you have access to data that is in great shape, it’s probably because someone else did the dirty work of validating it, cleaning it up, and normalizing …

Matching data between data sources with Python Read More »

Finding and analyzing free stock index data with Python and EDGAR

Finance, Python / By Matt Wright

A stock index is just a list of stocks. But an index is a special list because investors use it to make investing decisions. An index is constructed via rules about stocks to include, how much to include, and when to include (or remove it). Finding this data, especially for more obscure indexes, can be …

Finding and analyzing free stock index data with Python and EDGAR Read More »

Use pandas DateOffsets for easy date manipulation

Leave a Comment / Pandas, Python / By Matt Wright

So much useful data has a date or time component. Often, data has a timestamp to represent when the data was acquired, or when an event will take place, or as an identifying attribute like an expiration date. For this reason, understanding how to work with dates and times effectively can be a very useful …

Use pandas DateOffsets for easy date manipulation Read More »

Don’t append rows to a pandas DataFrame

Pandas, Python / By Matt Wright

Most pandas users encounter a situation where choosing to append rows to a pandas DataFrame seems like a good idea. A quick search of the API (or your favorite search engine) reveals that pandas has an append method in DataFrame. You may be tempted to use it. In this article I’ll show you why you …

Don’t append rows to a pandas DataFrame Read More »

Using multiple kernels in Jupyter

Leave a Comment / Python / By Matt Wright

If you’ve used a Jupyter notebook, you’ve used a kernel. A kernel is a process that executes code from a front-end process. Usually, if you are working in Python, the kernel you use is the IPython kernel. The IPython kernel usually matches the Python version and contains the same libraries as the process running the …

Using multiple kernels in Jupyter Read More »

An introduction to accessing financial data in EDGAR, using Python

Leave a Comment / Finance, Python / By Matt Wright

Some sources of financial data can be expensive or difficult to find. For example, some is only available from exchanges or vendors who charge a hefty fee for access. However, the financial industry is also heavily regulated, and one of its main regulators provides free access to its data. The U.S. Securities and Exchange Commission …

An introduction to accessing financial data in EDGAR, using Python Read More »

Passing date arguments to a Python script using argparse

Leave a Comment / Python / By Matt Wright

Argparse doesn’t support date arguments by default, but it can be easily extended to parse and validate dates in your Python scripts.

Using requests and BeautifulSoup in Python to scrape data

3 Comments / Python / By Matt Wright

Sometimes pandas.read_html doesn’t work for scraping website data, but you can try using requests and BeautifulSoup to do it yourself.