Iain Sproat
d71c563831
XmlImportUtilities.detectPathFromTag and XmlImportUtilities.detectRecordElement methods now use a generic TreeParser interface. A lightweight wrapper XmlParser wraps XMLStreamReader to provide parsing for xml data.
...
This is another small step towards a generic importer for tree structured data. My plan is to refactor more of XmlImportUtilities' methods to use the TreeParser interface so that XmlStreamReader is no longer called directly from XmlImportUtilities.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1322 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:40:51 +00:00
Iain Sproat
1bda46d40f
Methods which are generic to any tree structured data and don't rely on an XmlParser have been moved to a new TreeImporter class. This is a small step towards supporting importers for other tree structured data.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1321 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 16:09:44 +00:00
Iain Sproat
2df0d1ac3a
Eclipse .launch file for tests now correctly points to project 'grefine' rather than incorrect 'refine'.
...
XmlImport tests now pass - the tests themselves were not testing the current xml being input to them.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1320 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 14:47:06 +00:00
Stefano Mazzocchi
6273332cef
the sandbox->freebase loading conduit is now named "refinery"
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1313 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-25 01:57:56 +00:00
David Huynh
289d4e4c57
Ignore classes/ sub-dirs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1303 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:47:29 +00:00
David Huynh
64727ebd03
Somehow this fell out of a previous checkin.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1302 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:45:18 +00:00
David Huynh
8b115f67ff
Stray renaming
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1301 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:42:08 +00:00
David Huynh
a112ffa9ab
Caught a stray rename miss. Added more generic support for renaming old Java classes so that extensions could remain backward-compatible, too
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1297 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:59:57 +00:00
David Huynh
289f46bbf5
Removed yet another old empty Java package dir.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1296 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:36:34 +00:00
David Huynh
6ee94132f9
Removed another old empty Java package dir.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1295 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:34:03 +00:00
David Huynh
b45506b787
If refine data dir doesn't exist, try to find and move gridworks data dir over
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1294 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:30:14 +00:00
David Huynh
d15fe661fa
Renamed in client-side code. Refine should start and existing projects should still work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1293 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 22:55:28 +00:00
David Huynh
22cbcddd78
Renamed in build scripts. ./refine build, ./refine, and ./refine dist all work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1292 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 21:47:35 +00:00
David Huynh
fe855c3aae
Removed old empty Java packages
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1291 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:49:08 +00:00
David Huynh
1367ce301e
More renaming, except for: client-side code, build scripts, anything to do with data loading and QA, workspace path. Refine can still run, and undo/redo on existing projects is working.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1290 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:36:33 +00:00
David Huynh
e6bc603a11
Renamed Java classes whose names contain 'Gridworks'. Refine is still able to start. But don't check out the code just yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1289 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:46:39 +00:00
David Huynh
edb23eb263
Changed Java packages com.google.gridworks.* to com.google.refine.* and modified other code just enough to start grefine up without error. Much remains to be done. Do not check out the code just yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1288 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:04:10 +00:00
David Huynh
a40a09b391
Renamed sub-projects as they are known to Eclipse
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1287 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 16:40:46 +00:00
David Huynh
65dbb5a2dc
Removed RDF Exporter extension as its authors have agreed to host it themselves. For more information on that extension, see http://lab.linkeddata.deri.ie/2010/dcat/
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1286 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 15:49:45 +00:00
Stefano Mazzocchi
871bd356f6
swapping extensions
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1285 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-21 17:43:50 +00:00
Stefano Mazzocchi
06e83a66ff
Issue-134: make sure the {build.dir} directory actually exist (it normally gets created by the invoking ./gridworks shell file, but only when calling a build, not a dist, so act more defensively here)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1275 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-19 17:08:39 +00:00
Stefano Mazzocchi
0a0add9e8e
Issue-135: d'oh, stupid mistake
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1274 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-19 16:44:04 +00:00
David Huynh
362a277c58
Added main menu command to open system file explorer at the workspace directory.
...
Made project manager more careful at disposing projects, in case any of them is null.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1272 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 06:52:10 +00:00
David Huynh
2609c4049d
Fixed issue 114: "Refactor project manager api to allow importers to create project metadata" by incorporating tfmorris' patch.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1271 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 01:00:23 +00:00
David Huynh
8d1f2d44b9
Patched the json lib to allow up to 100 levels of nesting.
...
Fixed ImportProjectCommand to redirect from the error page back to /index rather than /index.html.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1270 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 00:21:54 +00:00
Stefano Mazzocchi
eee4514643
fixing Issue-125
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1269 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 23:05:53 +00:00
Stefano Mazzocchi
9c5eda343a
last of the bashisms (hopefully)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1268 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 22:00:54 +00:00
Stefano Mazzocchi
3667806e3a
does not belong in version control
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1267 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 21:59:31 +00:00
Stefano Mazzocchi
81190b1e50
fixing bashisms
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1266 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 21:53:38 +00:00
Stefano Mazzocchi
221504e2da
- removed all bash-isms (hopefully)
...
- hardened FULL_VERSION calculation (used only in the win32 exe, btw)
- made windows_dist run on all operating systems, not just mac
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1265 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 18:16:46 +00:00
Stefano Mazzocchi
df0a30e22d
this is really a debug log
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1262 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 21:52:02 +00:00
David Huynh
9acd3dbe05
Fixed issue 127 - Add column from Freebase raises exception. Made sure DataExtensionChange saves properly.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1261 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 04:53:37 +00:00
Stefano Mazzocchi
e6d293bcc8
make it compile again
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1260 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-05 06:07:32 +00:00
Stefano Mazzocchi
4af3a4cbf5
allow gridworks to load extensions also from the same place where data is stored (in the 'extension' subfolder')
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1258 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-02 01:17:54 +00:00
Stefano Mazzocchi
089105a997
turns out people are having problems with this in Ubuntu... so let's avoid it for now
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1257 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-01 00:23:56 +00:00
Stefano Mazzocchi
e973fd3e89
d'oh, wrong object counter (thanks again to knut.forkalsrud for spotting my mistakes)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1250 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:25:16 +00:00
Stefano Mazzocchi
3bb0d3918c
adding the latest butterfly with the ability to avoid loading specific extensions (selected with strings or regexps on their names) via a property
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1247 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:08:47 +00:00
Stefano Mazzocchi
86ee119c44
more changes
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1246 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:47:06 +00:00
Stefano Mazzocchi
df19fc2065
Fixed ISSUE-29
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1245 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:47:01 +00:00
Stefano Mazzocchi
21ac857a50
more changes
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1244 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:33:10 +00:00
Stefano Mazzocchi
e5c6dda178
Fixed Issue-116
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1243 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:33:05 +00:00
Stefano Mazzocchi
7df259008b
more whitespace
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1242 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:59 +00:00
Stefano Mazzocchi
900b173d99
keep track of changes
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1241 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:54 +00:00
Stefano Mazzocchi
cf66d00854
only whitespace (no functional changes)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1240 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:48 +00:00
Stefano Mazzocchi
860d6c4ee2
a little more solid (it's possible to have both Dates and Calendars in there)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1239 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:12:11 +00:00
Stefano Mazzocchi
3648883e0c
ISSUE-99 thanks to knut.forkalsrud for providing the patch!
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1238 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 18:56:35 +00:00
Stefano Mazzocchi
8bff031c1c
increasing permgen and enabling garbage collection of classes in debug mode
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1235 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 17:58:15 +00:00
Stefano Mazzocchi
5d788c9260
added timeline facet (like the numeric binning facet but working on dates instead of numbers and with date-specific binning logic)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1234 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 17:57:54 +00:00
Stefano Mazzocchi
37099b4c9f
new butterfly version: mount extensions in their own subdirectory in the URL space
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1233 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 01:14:47 +00:00
Stefano Mazzocchi
6c75b8102e
clean up extensions too when cleaning up the code
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1232 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 01:14:37 +00:00