Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Bump the release to 0.40.4 | Matěj Cepl | 2014-01-04 | 2 | -1/+2 |
| | |||||
* | Update pip install in .travis.yml | Matěj Cepl | 2014-01-04 | 3 | -5/+7 |
| | | | | Supporting 2.6 would be too difficult ... patches welcome! | ||||
* | Make at least unittests running with Python 2.7 | Matěj Cepl | 2014-01-04 | 5 | -22/+49 |
| | | | | | Add support for Travis-CI Fixes #283 | ||||
* | Fix setup.py | Matěj Cepl | 2014-01-03 | 4 | -2/+9 |
| | |||||
* | Create setup.py and add GPLv3 license. | Matěj Cepl | 2014-01-03 | 2 | -0/+71 |
| | | | | Fixes #282 | ||||
* | Improve README.rst | Matěj Cepl | 2014-01-03 | 1 | -4/+18 |
| | |||||
* | Skip over unfilled items in the unmangling config. file. | Matěj Cepl | 2014-01-03 | 1 | -0/+2 |
| | | | | Fixes #281 | ||||
* | Survive HTTPErrors on malformed URLs. | Matěj Cepl | 2014-01-03 | 1 | -10/+16 |
| | |||||
* | Ignore results of the work | Matěj Cepl | 2014-01-03 | 1 | -0/+2 |
| | |||||
* | Some logging to allow some following of the process. | Matěj Cepl | 2014-01-02 | 1 | -8/+12 |
| | | | | Fixes #280 | ||||
* | Unmangle mbox according to the configuration file. | Matěj Cepl | 2014-01-02 | 4 | -11/+327 |
| | | | | Fixes #273, #276 | ||||
* | For each group generate also a list of all mangled addresses. | Matěj Cepl | 2014-01-02 | 5 | -17/+63 |
| | | | | | | | | | | Google Groups (rightly) protects addresses against spammers. There is (obviously) no way how to find true value of these addresses programmatically, so we just generate list of all affected ones, which could be later completed with true values (collected somehow) and fixed by some other script. Fixes #275 | ||||
* | Creating raw MBOX fixed (tests included) | Matěj Cepl | 2014-01-02 | 5 | -34/+467 |
| | | | | Fix #278 and #271 | ||||
* | General structure of operation and MBOX writing. | Matěj Cepl | 2013-12-30 | 17 | -2762/+46 |
| | | | | So far, only unit test for the latter. | ||||
* | Collect raw article | Matěj Cepl | 2013-12-30 | 3 | -1/+125 |
| | |||||
* | Collect articles for one topic. | Matěj Cepl | 2013-12-29 | 5 | -20/+83 |
| | |||||
* | Convert to python3. | Matěj Cepl | 2013-12-28 | 4 | -44/+50 |
| | |||||
* | Collecting topics. | Matěj Cepl | 2013-12-28 | 18 | -7/+2893 |
| | | | | Added also some testing pages. | ||||
* | Add a note about the inspiration for the project. | Matěj Cepl | 2013-12-27 | 1 | -0/+3 |
| | |||||
* | Remove BE repository and move to Bugzilla | Matěj Cepl | 2013-12-27 | 9 | -303/+1 |
| | | | | https://luther.ceplovi.cz/bugzilla/buglist.cgi?quicksearch=product%3Agg_scrapper | ||||
* | Start of the project. | Matěj Cepl | 2013-11-22 | 12 | -0/+375 |
Based on the unescpaing algorithm from https://developers.google.com/webmasters/ajax-crawling/docs/getting-started (on which I have been reminded by Sean Hogan (http://www.meekostuff.net/)) |