Tree
- Tree:
9500180d4fcabd71f9c9206c49f7a3fbe8f3f895
- Date:
- Message:
- cleanup of libraries and dependencies
README | commits | blame |
pom.xml | commits | blame |
src/ |
README
This simple project can be used to convert wikipedia dumps to plain text. usage: java -Xmx2G -Dfile.encoding=UTF-8 -jar wiki2text-1.0-jar-with-dependencies.jar nlwiki-20120203-pages-articles.xml.bz2 > nl.txt