0562638ffa
* Use standard text normalization - fixes #2898 Fixes #2898. Fixes #409. Refs #650 Replaces homegrown ISO Latin-1 only character subsitition with standard Java Normalize to NFD, followed by diacritic removal and a few custom character expansions/replacements. * Fix Mac build * Improve compatibility with previous code One intentional change is folding O with stroke to oe instead of o. - Use more powerful NFKD instead of NFD - strip punctuation after decomposition since it can generate new punctuation - Add compatibility test for old asciify() method - Add some graphically similar characters to substitution table * Add oe character/ligature & more long S forms * More tests for ligatures and Latin Extended * Add Latin-1 Supplement tests |
||
---|---|---|
.github | ||
benchmark | ||
conf | ||
docs | ||
extensions | ||
graphics | ||
IDEs/eclipse | ||
licenses | ||
main | ||
packaging | ||
server | ||
.gitattributes | ||
.gitignore | ||
.travis.yml | ||
appveyor.yml | ||
AUTHORS.md | ||
CHANGES.txt | ||
CODE_OF_CONDUCT.md | ||
CONTRIBUTING.md | ||
GOVERNANCE.md | ||
lgtm.yml | ||
LICENSE.txt | ||
pom.xml | ||
README.md | ||
refine | ||
refine.bat | ||
refine.ini | ||
settings.xml | ||
start.sh | ||
stop.sh | ||
WeLoveOurBackers.md |
OpenRefine
OpenRefine is a Java-based power tool that allows you to load data, understand it, clean it up, reconcile it, and augment it with data coming from the web. All from a web browser and the comfort and privacy of your own computer.
Download
Run from source
If you have cloned this repository to your computer, you can run OpenRefine with:
./refine
on Mac OS and Linuxrefine.bat
on Windows
This requires JDK 8 and Apache Maven.
Documentation and Videos
Contributing to the project
Contact us
Licensing and legal issues
OpenRefine is open source software and is licensed under the BSD license
located in the LICENSE.txt. See the folder licenses
for information on open source
libraries that OpenRefine depends on.
Credits
This software was created by Metaweb Technologies, Inc. and originally written and conceived by David Huynh dfhuynh@google.com. Metaweb Technologies, Inc. was acquired by Google, Inc. in July 2010 and the product was renamed Google Refine. In October 2012, it was renamed OpenRefine as it transitioned to a community-supported product.
See AUTHORS.md for the list of OpenRefine contributors and CONTRIBUTING.md for instructions on how to contribute yourself.