Commit Graph

3572 Commits

Author SHA1 Message Date
Stefano Mazzocchi
2691ee50d7 adding OS-specific data paths
git-svn-id: http://google-refine.googlecode.com/svn/trunk@173 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 02:53:07 +00:00
Stefano Mazzocchi
52eef30177 adding government contracts from http://it.usaspending.gov/
git-svn-id: http://google-refine.googlecode.com/svn/trunk@172 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 00:48:16 +00:00
David Huynh
ad7671508f Added "cancel processes" command, not hooked up yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@171 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 00:30:39 +00:00
David Huynh
2741d54984 Added custom suggest widget for properties.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@170 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:48:21 +00:00
David Huynh
59c5314e42 Fixed bug in list facet: list facets on columns with numeric data weren't working before.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@169 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:07:33 +00:00
David Huynh
b488d093c8 Added indicator for facet refreshing, and control for forcing facets to refresh.
Made facets drag-and-drop-able.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@168 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 22:32:52 +00:00
David Huynh
3ecfb4e4d9 Implemented facet-based edit operation for real.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@167 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 20:33:11 +00:00
David Huynh
d9507200f8 Started work on the facet-based edit dialog, which allows editing of cells based on their projections to facet choices.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@166 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 19:58:47 +00:00
David Huynh
512cd16381 Implemented recon by keys, guids, and ids.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@165 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 18:19:20 +00:00
David Huynh
99ae6109d8 Started work on key-based recon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@164 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 03:31:58 +00:00
David Huynh
e57aae888b Hooked up the recon service at data.labs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@163 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 00:33:32 +00:00
David Huynh
f16727c20c Refactored recon code on the server side to prepare for supporting other modes of recon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@162 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 23:33:23 +00:00
David Huynh
a17882c44f Redesigned the recon dialog. New features are not implemented yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@161 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 21:38:50 +00:00
Stefano Mazzocchi
621655372f - save encoding and confidence in the project metadata
- use the saved encoding for decoding
- don't error when fingerprinting null


git-svn-id: http://google-refine.googlecode.com/svn/trunk@160 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 04:56:16 +00:00
David Huynh
bc9bc54d30 Implemented a meta parser that looks for a language prefix and picks the right parser.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@159 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:50:56 +00:00
David Huynh
acfa19a683 Moved GEL stuff (gridworks expression language) into gel package.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@158 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:30:31 +00:00
David Huynh
84269e7e83 gel package for gridworks' native expression language stuff. This is to prepare for multiple language support.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@157 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:15:46 +00:00
David Huynh
7c38fbb945 Created an ast package for gridworks expression language abstract syntax tree nodes. Moved parsing exception class out to its own file.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@156 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:14:21 +00:00
David Huynh
983be19e14 Made EvalError serializable because errors can be cell values and need to be saved.
Turned is* functions into controls, since they have to be able to test errors, and only controls can do that, not functions.
Polished display of errors in cells and in expression preview dialog.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@155 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 00:21:13 +00:00
Stefano Mazzocchi
0c6590fe2c - added an encoding guesser
- fixed a bunch of encoding issues
- added a function to reinterpret call content in another encoding
- added a 'phonetic' function to the expression language that supports metaphone and soundex
- updated the COS library to the latest released version 
- added the IBM ICU4j library (that contains the encoding guesser)
- added examples with same content but different encodings


git-svn-id: http://google-refine.googlecode.com/svn/trunk@154 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-28 21:51:33 +00:00
Stefano Mazzocchi
d9e67ac806 - diff now can act before two dates (still to be fully tested)
- added string fingerprinting function (useful for clustering)
- fixed unicode() function which wasn't returning correct values
- added a toString method to EvalError to know what error that was
- fixed a NPE in TextTransformationOperation


git-svn-id: http://google-refine.googlecode.com/svn/trunk@153 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-28 00:55:09 +00:00
Stefano Mazzocchi
f1923758e7 - add a bunch of new functions
- very lax date parser
 - lots of new advanced string functions
- new version of commons-lang


git-svn-id: http://google-refine.googlecode.com/svn/trunk@152 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 08:56:04 +00:00
David Huynh
25fd5794cd Added choices blank and error to list facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@151 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 06:59:55 +00:00
David Huynh
49e7241d1d Re-organized functions into a few sub-packages.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@150 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 05:59:09 +00:00
David Huynh
5f4dc16d20 Some new packages for re-organizing functions.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@149 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 05:51:20 +00:00
David Huynh
c914aa6c16 Introduced EvalError objects as possible values returned by expressions.
Extracted function and control name mappings to ControlFunctionRegistry.


git-svn-id: http://google-refine.googlecode.com/svn/trunk@148 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 05:48:33 +00:00
David Huynh
f0b8268809 Implemented permanent link, at least for facets' states.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@147 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 00:16:44 +00:00
David Huynh
f5ff9044cf Track and display recon stats in column headers.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@146 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 23:33:16 +00:00
David Huynh
aadf98a388 Show choice count in list facet. Use close.png for close button.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@145 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 22:24:25 +00:00
David Huynh
c601bb6b0a Re-organized .js files a bit.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@144 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 22:07:54 +00:00
David Huynh
9f7a86fb4d Folders for re-organizing .js files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@143 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 22:03:24 +00:00
David Huynh
30dce3b3d5 Made range facet more robust against bad expressions.
Centralized code that updates components of the UI. Show "Working..." indicator if anything takes more than 500ms.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@142 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 21:56:41 +00:00
David Huynh
1e4b9f4e80 Fixed bug in text search facet where if the query is null or empty string it'd filter to nothing.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@141 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 18:44:36 +00:00
Stefano Mazzocchi
7152c4aae2 make sure to set JAVA_HOME when it's not set and we already know where it is
git-svn-id: http://google-refine.googlecode.com/svn/trunk@139 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-25 00:07:18 +00:00
Stefano Mazzocchi
2e5267c088 allowing to use letters in windows versions
git-svn-id: http://google-refine.googlecode.com/svn/trunk@138 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 23:52:20 +00:00
David Huynh
e2ccd491c5 New dmg background.
Added download site html.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@137 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 23:28:24 +00:00
David Huynh
c81913c6f8 Make sure process widget doesn't get stuck.
Enforce maximum height on process widget.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@136 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 19:48:11 +00:00
David Huynh
bb83dcda1c Added support for specifying number of initial rows to skip when creating a new project.
Fixed the height of the histogram images in range facets to eliminate jitters.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@135 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 18:52:54 +00:00
David Huynh
254853b51d Added reverse and sort functions.
Support a limit on how many rows to load into a new project.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@134 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 23:22:02 +00:00
David Huynh
e449617960 Fixed minor bug introduced in last change: recon candidates didn't get rendered at all.
Added "refresh" link for list facets.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@133 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 20:20:14 +00:00
David Huynh
2eb3a67125 Styled data table view summary a bit to highlight current number of rows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@132 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 19:18:25 +00:00
David Huynh
4bdb2320b7 Styled help tab of expression preview dialog. Added variables section.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@131 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 19:06:58 +00:00
David Huynh
dce42400d4 Fixed bug introduced while trying to delay constructing the candidates arrays in Recon objects.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@130 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 17:47:34 +00:00
David Huynh
ec1604e815 Added support for starring rows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@129 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 07:45:12 +00:00
David Huynh
92806cea7e Forgot one instance of "gridlock" after project rename.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@128 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 01:49:41 +00:00
David Huynh
8992531d02 Documented functions and controls in expression language.
Better error checking in operator calls.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@127 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 01:40:06 +00:00
David Huynh
607fca04cb Added a few more math functions.
Fixed expression preview dialog to use tabs.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@126 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 00:33:39 +00:00
David Huynh
94fbd97bc4 Added a few more expression functions.
Bind row index when filtering rows, so we can create facets based on row indices.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@125 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:51:44 +00:00
David Huynh
0f505c72c5 Delay constructing the candidates array in recon objects to save memory.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@124 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:27:16 +00:00
David Huynh
c45e0edc10 Lower recon batch size back to 10.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@123 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:06:34 +00:00