The bleeding edge of publishing: a quick biorxiv scrape

Posted on Mon 19 December 2016 in open science • Tagged with python, open science, visualizations, web scraping, preprints

Scraping publication amounts at biorxivΒΆ

Per a recent request somebody posted on Twitter, I thought it'd be fun to write a quick scraper for the biorxiv, an excellent new tool for posting pre-prints of articles before they're locked down with a publisher embargo.

A big benefit of open science is the ability to use modern technologies (like web scraping) to make new use of data that would originally be unavailable to the public. One simple example of this is information and metadata about published articles. While we're not going to dive too deeply here, maybe this will serve as inspiration for somebody else interested in scraping the web.


Continue reading

Visualizing publication bias: the case of funnel plots

Posted on Wed 30 November 2016 in open science • Tagged with python, funnel plots, open science, visualizations, simulations

This article is now interactive! Check out a live Binder instance here

In the next few months, I'll try to take some time to talk about the things I learn as I make my way through this literature. While it's easy to make one-off complaints to one another about how "science is broken" without really diving into the details, it's important learn about how
Continue reading


5 things I learned at SciPy

Posted on Tue 01 November 2016 in open science • Tagged with python, open science, programming, conferences

I've finally decompressed after my first go-around with Scipy. For those who haven't heard of this conference before, Scipy is an annual meeting where members of scientific community get together to discuss their love of Python, scientific programming, and open science. It spans both academics and people from industry, making it a unique place in terms of how software interfaces with scientific research. (if you're interested the full set of Scipy conferences, check out here
Continue reading