Tree
- Tree:
0cf11e870092628497b8b71a17182f12e2a2ef16
- Date:
- Message:
- initial commit
README | commits | blame |
pom.xml | commits | blame |
src/ |
README
This simple project can be used to convert wikipedia dumps to plain text. usage: java -Xmx2G -Dfile.encoding=UTF-8 -jar wiki2text-1.0-jar-with-dependencies.jar nlwiki-20120203-pages-articles.xml.bz2 > nl.txt