README.md
7d91adf2
 [![License](https://img.shields.io/badge/license-MIT_license-blue.svg)](http://git.dlma.com/techcrunch.git/blob/master/LICENSE.txt)
 ![python2.x](https://img.shields.io/badge/python-2.x-yellow.svg)
 # TechCrunch Feed Filter
 
 This is a Python script run as a cronjob to read the TechCrunch article feed, 
 and decide which articles to include in its own feed.
 
4dfd321c
 ## The Problem
 
 TechCrunch was a great blog about innovation and entrepreneurship. As it grew, 
 it published more articles than I cared to read. Like many savvy blog readers, 
a7d427d9
 I used a feed reader to present the latest articles to me, but TechCrunch's 
 feed was simply too profuse.
4dfd321c
 
 ## The Solution
 
 I created a service that'd visit TechCrunch's feed, and make note of who made 
 which articles, what the articles were about, how many comments each article 
a7d427d9
 had, and how many Diggs, Facebook likes and Facebook shares each article had.
4dfd321c
 
 With that data, the service would determine the median, mean, standard 
 deviation, and create a minimum threshold for whether the article merited 
 being seen by me.
 
06ebb890
 I go into deeper detail in this [blog post about it](https://david.dlma.com/blog/my-techcrunch-feed-filter).
4dfd321c
 
 Here's [the status page it generates](http://techcrunch.dlma.com/).
7d91adf2
 
297a7665
 # To Do
 
 * Maybe use Reddit upvotes
 
 # Pre-Git History
7d91adf2
 
 This was originally archived in a Subversion repo. I'd forgotten about the
 version control and had gotten into the habit of just modifying the production
 site.
 
 * 2010-09-03: Original
 * 2010-09-03: Save off the disqus identifier for use later.
 * 2011-02-04: Algorithm changes (tags and author checked), new chart drawing, spaces used instead of tabs.
 * 2011-02-04: Update to the chart drawing algorithm.
 * 2013-08-04: Miscellaneous changes to techcrunch.py
 * 2015-11-23: Resync svn with production site.
 * 2015-11-27: Remove obsolete disqus and retweet code, and refactor style to be more PEP-8ish.
 
 # Is it any good?
 
 [Yes](https://news.ycombinator.com/item?id=3067434).
 
 # Licence
 
06ebb890
 This software uses the [MIT license](http://git.dlma.com/techcrunch.git/blob/master/LICENSE.txt).