Code/data

Here you’ll find links to data, software packages, code, and other resources that I’ve created and written for research purposes. Most of the code is written in C, C++, or Python. What's listed here is just the tip of the iceberg, and I'm generally happy to share data and code, so if you come across something in one of my papers and would like to learn more, send me an email.

Software

cdindex

  • Python package for calculating the CD index and related measures
  • Includes random graph generators
  • Core functionality written in C for fast computation
  • Website
  • Python Package Index page
  • Github page

edgraph

  • C++ library for calculating the CD index and related measures
  • Uses C++ STL multimaps for fast computation
  • Github page

Teaching materials

Computational social science for organizational research

Big data tools for strategy research

  • 3 week course
  • PhD level
  • Covers Python, MySQL, and workflow
  • Github page

Introduction to social network analysis with Python

  • Originally developed for the National Science Foundation/National Center for Science and Engineering Statistics
  • Covers the basics of mathematical representation, networkx, visualization, data wrangling, and APIs
  • Uses data from the United States Patent and Trademark Office to map inventor networks
  • Github page

Introduction to social network analysis for the digital humanities

  • Originally developed for the Digital Arts, Sciences, and Humanities (DASH) program at the University of Minnesota
  • Covers the basics of mathematical representation, Python, networkx, visualization, data wrangling, and web scraping
  • Collects original data to conduct a network analysis of Shakespeare's tragedies and histories
  • Github page

Introduction to social network analysis with R

  • Originally developed for the University of Minnesota Strategy PhD program
  • Covers the basics of igraph, visualization, data wrangling, APIs, and spatial networks
  • Uses data from the United States Patent and Trademark Office to map inventor networks
  • Github page

Introduction to record linkage with Python

  • Originally developed for University of Michigan's Interdisciplinary Committee on Organizational Studies Big Data Bootcamp
  • Covers the basics of record linkage, fuzzy matching, deduplication, and web scraping
  • Uses data from the United States Department of Defense and OpenSecrets
  • Github page