Commit Graph

1734 Commits

Author SHA1 Message Date
David Huynh
f2ce1b7161 Fixed Issue 121: Importing attached file strips backslashes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1369 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 03:35:42 +00:00
Stefano Mazzocchi
c976091624 new hooks to the Freebase Refinery
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1368 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 02:19:50 +00:00
David Huynh
823fe989a4 Fixed Issue 110: Import of single column text file with Postal Codes shows only 1 row with lots of � chars (?).
(by enforcing a confidence threshold on the encoding guessing)

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1367 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 00:26:53 +00:00
Stefano Mazzocchi
14d046bb7a silence velocity's logs
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1366 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 00:01:53 +00:00
Stefano Mazzocchi
c7d421969e ignore test output
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1365 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 23:56:26 +00:00
Stefano Mazzocchi
ad1db143c9 more info on how to run the tests in eclipse
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1364 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 23:45:25 +00:00
Stefano Mazzocchi
c44dd68f92 remove duplication
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1363 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 23:39:10 +00:00
Iain Sproat
c3c23a87b0 The renaming of TreeImporter to TreeImportUtilities didn't seem to get committed last time. Trying again.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1362 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:57:26 +00:00
Iain Sproat
d285999da8 New JsonImporter, JsonParser and JsonImporterTests (copy of XmlImporterTests with syntax of the example data altered for Json).
Renaming of TreeImporter to TreeImportUtilities (as per the current convention with the XmlImporter and XmlImportUtilities).

NB the new JsonParser class does not work, and 5 of the new unit tests for JsonImporter currently fail.  To be fixed in due course.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1361 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:53:17 +00:00
Stefano Mazzocchi
8cd6476fa4 remove empty dir
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1357 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:13:38 +00:00
Stefano Mazzocchi
565e3b6e90 adding default code style info to eclipse projects
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1356 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:11:38 +00:00
Stefano Mazzocchi
81c112e315 removing empty dirs
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1355 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:06:04 +00:00
Stefano Mazzocchi
86f810a324 hardening the timeline facet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1353 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 21:59:17 +00:00
Iain Sproat
e5ddfa6fdc All methods in XmlImportUtilities now use the TreeParser interface.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1323 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:59:53 +00:00
Iain Sproat
d71c563831 XmlImportUtilities.detectPathFromTag and XmlImportUtilities.detectRecordElement methods now use a generic TreeParser interface. A lightweight wrapper XmlParser wraps XMLStreamReader to provide parsing for xml data.
This is another small step towards a generic importer for tree structured data.  My plan is to refactor more of XmlImportUtilities' methods to use the TreeParser interface so that XmlStreamReader is no longer called directly from XmlImportUtilities.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1322 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:40:51 +00:00
Iain Sproat
1bda46d40f Methods which are generic to any tree structured data and don't rely on an XmlParser have been moved to a new TreeImporter class. This is a small step towards supporting importers for other tree structured data.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1321 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 16:09:44 +00:00
Iain Sproat
2df0d1ac3a Eclipse .launch file for tests now correctly points to project 'grefine' rather than incorrect 'refine'.
XmlImport tests now pass - the tests themselves were not testing the current xml being input to them.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1320 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 14:47:06 +00:00
Stefano Mazzocchi
6273332cef the sandbox->freebase loading conduit is now named "refinery"
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1313 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-25 01:57:56 +00:00
David Huynh
289d4e4c57 Ignore classes/ sub-dirs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1303 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:47:29 +00:00
David Huynh
64727ebd03 Somehow this fell out of a previous checkin.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1302 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:45:18 +00:00
David Huynh
8b115f67ff Stray renaming
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1301 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:42:08 +00:00
David Huynh
a112ffa9ab Caught a stray rename miss. Added more generic support for renaming old Java classes so that extensions could remain backward-compatible, too
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1297 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:59:57 +00:00
David Huynh
289f46bbf5 Removed yet another old empty Java package dir.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1296 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:36:34 +00:00
David Huynh
6ee94132f9 Removed another old empty Java package dir.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1295 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:34:03 +00:00
David Huynh
b45506b787 If refine data dir doesn't exist, try to find and move gridworks data dir over
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1294 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:30:14 +00:00
David Huynh
d15fe661fa Renamed in client-side code. Refine should start and existing projects should still work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1293 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 22:55:28 +00:00
David Huynh
22cbcddd78 Renamed in build scripts. ./refine build, ./refine, and ./refine dist all work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1292 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 21:47:35 +00:00
David Huynh
fe855c3aae Removed old empty Java packages
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1291 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:49:08 +00:00
David Huynh
1367ce301e More renaming, except for: client-side code, build scripts, anything to do with data loading and QA, workspace path. Refine can still run, and undo/redo on existing projects is working.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1290 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:36:33 +00:00
David Huynh
e6bc603a11 Renamed Java classes whose names contain 'Gridworks'. Refine is still able to start. But don't check out the code just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1289 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:46:39 +00:00
David Huynh
edb23eb263 Changed Java packages com.google.gridworks.* to com.google.refine.* and modified other code just enough to start grefine up without error. Much remains to be done. Do not check out the code just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1288 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:04:10 +00:00
David Huynh
a40a09b391 Renamed sub-projects as they are known to Eclipse
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1287 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 16:40:46 +00:00
David Huynh
65dbb5a2dc Removed RDF Exporter extension as its authors have agreed to host it themselves. For more information on that extension, see http://lab.linkeddata.deri.ie/2010/dcat/
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1286 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 15:49:45 +00:00
Stefano Mazzocchi
871bd356f6 swapping extensions
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1285 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-21 17:43:50 +00:00
Stefano Mazzocchi
06e83a66ff Issue-134: make sure the {build.dir} directory actually exist (it normally gets created by the invoking ./gridworks shell file, but only when calling a build, not a dist, so act more defensively here)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1275 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-19 17:08:39 +00:00
Stefano Mazzocchi
0a0add9e8e Issue-135: d'oh, stupid mistake
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1274 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-19 16:44:04 +00:00
David Huynh
362a277c58 Added main menu command to open system file explorer at the workspace directory.
Made project manager more careful at disposing projects, in case any of them is null.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1272 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 06:52:10 +00:00
David Huynh
2609c4049d Fixed issue 114: "Refactor project manager api to allow importers to create project metadata" by incorporating tfmorris' patch.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1271 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 01:00:23 +00:00
David Huynh
8d1f2d44b9 Patched the json lib to allow up to 100 levels of nesting.
Fixed ImportProjectCommand to redirect from the error page back to /index rather than /index.html.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1270 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 00:21:54 +00:00
Stefano Mazzocchi
eee4514643 fixing Issue-125
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1269 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 23:05:53 +00:00
Stefano Mazzocchi
9c5eda343a last of the bashisms (hopefully)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1268 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 22:00:54 +00:00
Stefano Mazzocchi
3667806e3a does not belong in version control
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1267 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 21:59:31 +00:00
Stefano Mazzocchi
81190b1e50 fixing bashisms
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1266 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 21:53:38 +00:00
Stefano Mazzocchi
221504e2da - removed all bash-isms (hopefully)
- hardened FULL_VERSION calculation (used only in the win32 exe, btw)
- made windows_dist run on all operating systems, not just mac

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1265 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 18:16:46 +00:00
Stefano Mazzocchi
df0a30e22d this is really a debug log
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1262 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 21:52:02 +00:00
David Huynh
9acd3dbe05 Fixed issue 127 - Add column from Freebase raises exception. Made sure DataExtensionChange saves properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1261 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 04:53:37 +00:00
Stefano Mazzocchi
e6d293bcc8 make it compile again
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1260 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-05 06:07:32 +00:00
Stefano Mazzocchi
4af3a4cbf5 allow gridworks to load extensions also from the same place where data is stored (in the 'extension' subfolder')
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1258 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-02 01:17:54 +00:00
Stefano Mazzocchi
089105a997 turns out people are having problems with this in Ubuntu... so let's avoid it for now
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1257 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-01 00:23:56 +00:00
Stefano Mazzocchi
e973fd3e89 d'oh, wrong object counter (thanks again to knut.forkalsrud for spotting my mistakes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1250 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:25:16 +00:00