Stefano Mazzocchi
d3d40d608a
bunch of PMD-induced fixes
...
(now the PMD report is clean)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@430 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 00:14:11 +00:00
David Huynh
759824e1b4
Bug fix: editing one facet choice while some other choices are selected resulted in no change.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@429 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 22:16:12 +00:00
David Huynh
c4460fdfdd
Made history panel expand on click rather than on hover.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@428 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 21:32:36 +00:00
Stefano Mazzocchi
7526c4e582
cleanups (no functional changes)
...
this makes pmd and javac on linux happier
git-svn-id: http://google-refine.googlecode.com/svn/trunk@427 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:46:02 +00:00
Will Moffat
fff6eb0aa2
JSLint: legimate comparisions to 0 and null should be made using ===
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@425 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:18:38 +00:00
Will Moffat
41a52f135d
JSLint: foo == 0, foo == null --> !foo
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@424 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:16:08 +00:00
Will Moffat
94aa2e0168
JSLint: semicolons
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@423 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 19:52:23 +00:00
Will Moffat
fbc1d04efb
JSLint fixes
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@421 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 18:21:28 +00:00
Stefano Mazzocchi
b0d23a4a10
another linux issue: openjdk6 comes in 'headless' mode as well, which throws if we attempt to contact the Desktop class, so make sure to warn the user nicely and avoid a nasty stacktrace printout
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@420 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 03:43:26 +00:00
David Huynh
bd24e5c7ff
Set ajax_in_progress attribute on <body> during ajax operations in order to facilitate testing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@416 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 21:31:50 +00:00
Will Moffat
efb9a43b73
Added: toDate() now supports formats.
...
My dataset has the same dates written in two styles: Nov-09 and 11/09
You can now parse this using: value.toDate2('MM/yy','MMM-yy').toString('yyyy-MM')
git-svn-id: http://google-refine.googlecode.com/svn/trunk@414 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 19:12:08 +00:00
David Huynh
302c27687c
"type" in freebase suggest results got dropped so we need to fetch the result's types ourselves.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@409 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 18:23:07 +00:00
Stefano Mazzocchi
6dbe794658
enabled windmill-based UI testing (type ./gridworks test to try)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@406 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 08:28:53 +00:00
David Huynh
0996b9e1dd
Gzip project export tar files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@394 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 22:28:30 +00:00
David Huynh
5320cc6587
Make duplicated column names unique during import by appending indices to them.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@392 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 17:55:36 +00:00
David Huynh
9d9329ca96
Implemented row remove command.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@391 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 07:47:44 +00:00
David Huynh
1fd85c62bf
Implemented column rename command.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@390 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 07:15:34 +00:00
David Huynh
a28a8d1769
Fixed bug in collapse and expand all columns commands.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@389 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 06:18:43 +00:00
David Huynh
93d6f9fc54
Better error message for bad regular expressions in GEL.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@388 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 06:18:07 +00:00
David Huynh
f402db10af
Implemented inter-project joins.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@387 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 05:35:48 +00:00
Stefano Mazzocchi
e2d92aa0b1
d'oh, placed the join in the wrong spot
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@386 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 01:13:29 +00:00
Stefano Mazzocchi
771810bc0d
avoid exception if there is only one extension in the whole archive
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@385 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-05 21:36:27 +00:00
Stefano Mazzocchi
9dfdd1e351
make the thread join so that we can use ctrl-c to exit from the console
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@384 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-05 21:36:08 +00:00
Stefano Mazzocchi
2efbf0031f
- removed the 'thirdparty' directory (now the 'gridworks' script will download and install needed tools if they are not present in the system already)
...
- added 'findbugs' command that uses the findbugs static analyzer to look for problems in the code
- fixed a bunch of issues that findbugs found (a few methods would go a little faster, and a few NPE will be avoided... nothing major but good to have)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@382 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-05 07:15:16 +00:00
Stefano Mazzocchi
798b2a36ca
- archive and compressed file importer (supports zip, tar, gz, bz2, tar.gz and tar.bz2)
...
(works by loading the files that have the most common extensions in the archive)
- changed default max heap to 3Gb
git-svn-id: http://google-refine.googlecode.com/svn/trunk@381 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-04 07:48:47 +00:00
Stefano Mazzocchi
65c5aea079
set the internal version by hand, this is done to avoid NPE after reloads but also because it's only used to version the data dumps so the granularity of svn revisions was too high anyway
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@380 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-03 17:01:04 +00:00
Stefano Mazzocchi
c24ec94835
had to shuffle around a bunch of classes to separate the main server classloader from the context classloader and allow reloading to happen for real
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@377 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-03 01:01:11 +00:00
Stefano Mazzocchi
72203cd3d5
- moved all code that contained MIT IP outside ( http://code.google.com/p/simile-vicino/ )
...
- moved bzip2 and tar code from apache ant into their own jar files
- now gridworks source contains only com.metaweb.* code everything else is a jar dependency
- started to work on archive importer
git-svn-id: http://google-refine.googlecode.com/svn/trunk@376 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 23:40:12 +00:00
Stefano Mazzocchi
4eda7ae2c0
avoid an array out of bounds exception in case there are no columns in the dataset
...
(I know, it should not happen but when it does let's not barf)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@375 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 20:21:41 +00:00
Stefano Mazzocchi
62f5f21ca3
atom is handled as well by the XML importer
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@374 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 06:44:05 +00:00
Stefano Mazzocchi
83faee3aa9
add a frame-less menu item in macosx to be able to open another gridworks browser window/tab in case we closed it by mistake
...
(no idea how to do this on windows, though, since there is no frame-less menu concept there)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@373 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 06:39:41 +00:00
Stefano Mazzocchi
521acda025
- pass the svn revision as format version (for more detailed verification)
...
- add an 'autoreload' setting that makes Gridworks autoreload its self if a class gets changed
(this is useful to make development cycles faster when working on the java code with autocompiling IDE like Eclipse or IDEA)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@372 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 00:52:38 +00:00
Stefano Mazzocchi
d1e72e7797
make the undo dialog closable
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@371 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 19:11:37 +00:00
Stefano Mazzocchi
988378c761
Hmm, String.split() bites us again: use the commons-lang one instead to avoid having to escape regexp values (this was preventing a user from splitting by "." in GEL)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@370 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 17:49:31 +00:00
Stefano Mazzocchi
0e07ec7acc
crude, I know, but for now make Gridworks digest RDF/XML as it was XML (works surprisingly well, btw)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@369 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 16:56:38 +00:00
Stefano Mazzocchi
dced641599
- added the ability to specify the character separator for CSV or TSV files that don't use commas or tabs (this was needed to parse a dataset that we got from the BBC to try things out)
...
- used commons-lang split function instead of the java String.split one, this is necessary to avoid having to escape separators that might be confused for regexps
git-svn-id: http://google-refine.googlecode.com/svn/trunk@368 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 22:34:21 +00:00
Stefano Mazzocchi
77b452e87f
adding version information to the about page
...
NOTE: this shows up only in the packaged distribution
git-svn-id: http://google-refine.googlecode.com/svn/trunk@367 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 19:20:32 +00:00
Stefano Mazzocchi
3c9af6501e
more consistent naming and various polishing
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@364 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:52 +00:00
Stefano Mazzocchi
5884d257db
default to 'relevance' instead of recon (which is faster). Change to recon if the user suggests schema hooks.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@363 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:17 +00:00
Stefano Mazzocchi
571f2c9ab3
- better README
...
- made the build system obtain and use svn revision info directly in version.js
- fixed lunch4j initial memory usage
- added .ini support for .exe starting in windows
- more robust up-to-date logic that uses SVN revisions instead of dates
- connected to new freebase.com/labs/gridworks web site
git-svn-id: http://google-refine.googlecode.com/svn/trunk@362 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 00:17:40 +00:00
Stefano Mazzocchi
f0fec31d98
moving the gridworks site over to freebase.com/labs for now (which is in acre)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@358 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-30 21:23:00 +00:00
Stefano Mazzocchi
7c132cfa53
clean eclipse warnings
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@357 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-29 20:51:18 +00:00
David Huynh
1d0e6abaf8
Got some work done on the plane:
...
- better detection of record XML elements in XML importer
- XML importer creates column groups and data table view renders them
git-svn-id: http://google-refine.googlecode.com/svn/trunk@356 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-27 05:23:09 +00:00
David Huynh
2a9fbd7d81
Made sure columns are named hierarchically in XML importer.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@355 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 23:15:10 +00:00
David Huynh
df7389876f
First shot at XML import.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@354 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 23:08:08 +00:00
David Huynh
4e76155652
Use application/x-unknown when exporting TSV so that the browser just saves the result. This is good for large exports, which overload the browser if the browser tries to display them.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@353 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 17:51:27 +00:00
David Huynh
47cad64a3f
Properly unescape \t, \r, \n, \\.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@352 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 17:50:44 +00:00
David Huynh
30e3ca4965
Added splitByLengths function.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@351 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 04:30:31 +00:00
David Huynh
4df1c4107a
Fixed a bug introduced recently: recon candidates were not serializing their topic types for the data view, so in the data view we can't send back a candidate's types when the user wants to match the candidate to some cells. I need to figure out a better way to optimize this.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@350 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 03:58:52 +00:00
David Huynh
00cce1b99a
Styling tweaks.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@349 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 21:24:44 +00:00