Recent Posts

  1. Boston-bound for 2018

    Sun 10 September 2017

    Well, I never thought it would happen. Today I qualified for the Boston Marathon with a 3:00:04 showing at the Presque Isle Marathon.

  2. Enabling Jupyter notebook dashboards

    Thu 04 May 2017

    If you perform EDA using jupyter notebooks, it’s really easy to share those results with some moderate interaction via a jupyter dashboard. Here are the basic steps:

    1. Build the analysis, etc. Assuming this is done locally. Install the dashboard layout extension and lay out some sweet graphs. Optional: decorate …

  3. Should I set metadata manually in pyspark?

    Thu 04 May 2017

    Well, let’s do a simple test and find out if it speeds up the process of one-hot encoding a variable in our data. There are other reasons to set it, and we’ll get to those. Starting with the very helpful code snippet from spark-gotchas:

    import json
    
    from pyspark …
  4. Spring 2017 Berekely DataViz Project Roundup

    Fri 28 April 2017

    Some really fantastic work from the teams in W209 this semester. Links:

    • https://rileyrustad.github.io/W209_Final/?demographic=Custom
    • https://sharmila-velamur.github.io/trumpworld/home.html
    • https://ramseynoj.github.io/history_of_refugee_migration/metrocosm/imigration-paths.html
    • http://people.ischool.berkeley.edu/~yhzhao/food_access.html#dashboard_2
    • http://people.ischool.berkeley.edu/~rthamman/W209_Project_Final …
  5. Modular papers with LaTeX

    Thu 23 March 2017

    Earlier this week I wrote about how to structure an academic project directory and one of those directories was paper. The project write-ups (read: papers) will live here. My tool of choice for writing is LaTeX. It is not for everyone, and since you’re this far I’m going …

  6. Structuring academic project directories

    Tue 21 March 2017

    Organizing the basic structure of your computer is tremendously helpful in keeping track of things. You’re a good person, so I’ll start by assuming that you’re already using unix. Still, it’s easy to get overwhelmed when poking through old directories that full of folders named “attempt1 …

  7. Digital Humanities in 2017

    Wed 11 January 2017

    I received this message:

    Tyler Jan 9

    Hello Mr. Reagan,

    Recently I read your paper "The emotional arcs of stories are dominated by six basic shapes", and it fascinated me. I'd love to learn more about how data scientists are able to deconstruct and graph language; where can I find …

  8. The Shapes of Stories

    Mon 07 November 2016

    This is a repost of my original at our StoryLab blog_

    Stories help us encode and understand our collective existence, underpin cultures, and help frame the possible. Describing the ecology of all human stories is an essential scientific enterprise. With the advent of the internet and massive digitization this …

« Page 5 / 8 »