David Huynh
b730dfd8f9
Added commands for searching for specific topics to match cells with.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@113 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-20 00:47:08 +00:00
David Huynh
ea2c904704
Use the schema index to suggest properties in the schema alignment dialog.
...
Fixed minor bug in triple loader transposer that wrote a bad triple for each literal cell value.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@112 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 22:56:29 +00:00
David Huynh
846e540ff6
Keep track of type names of reconciled columns so we can display them later in the schema alignment dialog.
...
Automatically create properties linking to all columns when starting with an empty protograph.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@110 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 18:32:48 +00:00
David Huynh
6c7557eeff
Minor bug fixes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@108 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 17:47:04 +00:00
David Huynh
bc412b99ea
Fixed bug in triple loader transposer: properties didn't get asserted before.
...
Made triple loader transposer index its output variables.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@106 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 07:05:09 +00:00
David Huynh
5264c829ae
A bit more careful error handling during recon.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@105 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 01:17:35 +00:00
David Huynh
28a86dfe0f
Automatically guess types to reconcile a column, using Stefano's trick in his "cupid" acre app.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@104 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 00:28:34 +00:00
David Huynh
4b2e48614b
Actual work in operations must be delayed until their changes are applied.
...
Column addition change must track the new cell index that it allocates when it is first applied.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@103 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-18 23:27:40 +00:00
David Huynh
8831703a2c
Implemented "apply operations" feature.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@102 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-18 05:00:56 +00:00
David Huynh
604dd53ebd
Engine configs were not deserialized properly when abstract operatons are retrieved.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@101 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:40:41 +00:00
David Huynh
8c41af9c12
Allow operations to be extracted in abstract forms.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@99 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:30:09 +00:00
David Huynh
32157ce76b
Changed operations to record column names instead of cell indices.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@98 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 00:26:38 +00:00
David Huynh
e6a98f23bd
Implemented triple loader preview.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@97 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 23:32:12 +00:00
David Huynh
aa530395d2
Use tabs in the schema alignment dialog to get more space.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@96 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 20:45:04 +00:00
David Huynh
5de0c36f86
Protograph preview now works.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@95 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 20:15:19 +00:00
David Huynh
8189ba74fd
Schema alignment dialog now saves protograph and re-renders it properly.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@92 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 22:11:49 +00:00
David Huynh
425140261f
We're starting to be able to save protographs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@91 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 20:29:56 +00:00
David Huynh
634d666949
More work on the schema alignment node dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@90 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 02:23:26 +00:00
David Huynh
f5942773ec
Still more work on the protograph, toward being able to build and save a protograph, but it's not working yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@89 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-11 06:44:48 +00:00
David Huynh
d227db0cc6
Eliminate hash maps from recon objects--they are expensive.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@88 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 19:25:21 +00:00
David Huynh
5cd147ea3c
Compute record indices and render them instead of row indices.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@87 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 18:54:53 +00:00
David Huynh
242e23c085
The schema alignment dialog is starting to work. The protograph gets rendered and is interactive. No saving yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@83 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 01:15:25 +00:00
David Huynh
97e2e0eddc
Implemented "judge one cell" command for making recon judgment per cell.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@80 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:04:43 +00:00
David Huynh
f8a1daba62
Handle formula cells in Excel files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@77 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 01:13:11 +00:00
David Huynh
b7cf18b86a
Save a change right after it gets applied rather than when it gets created. This is because when a change gets applied, it might grab onto the old data in order to able to revert later, and we need to save that old data together with the change.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@74 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 23:57:37 +00:00
David Huynh
cd376c7532
Added support for Excel 2007 XML file format.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@73 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 23:44:33 +00:00
Stefano Mazzocchi
1f5b27653e
POI deprecated the use of short, good thing
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@69 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 19:54:09 +00:00
Stefano Mazzocchi
1343162a75
major rewrite of the foundation:
...
- de-maveniziation (uses the same code that Acre uses to drive jetty directly)
- removed all dependencies on external javascript code (jquery and suggest) by making a local copy (this makes gridworks totally self-serving, meaning that you can use it even if you don't have any internet connectivity)
- fixed a NPE when the servlet is shutdown before any project is loaded
- found a way to spawn a browser directly from the java code (untested in windows)
- added two ant tasks to generate windows and macosx stand-alone binaries (unused just yet)
To run, just type "./gridworks run" at the command line
git-svn-id: http://google-refine.googlecode.com/svn/trunk@65 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 23:15:50 +00:00
David Huynh
8f186a5f10
Added a help panel to the expression preview dialog. It gets populated by function and control names for now; more info will come later.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@64 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 09:02:22 +00:00
David Huynh
d3f97fea93
While importing data, use null for cells with empty text.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@63 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 07:16:39 +00:00
Stefano Mazzocchi
a61f35079a
make eclipse happier by removing @Override annotations when really it's an interface method implementation
...
(no functional changes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@62 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 06:47:52 +00:00
David Huynh
a025b272bd
String.isEmpty() is no longer there (?!).
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@61 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 06:16:46 +00:00
David Huynh
2ceaa53390
Split css files.
...
Allow expressions to be evaluated on null cells too.
Various minor UI polishing touches and minor bug fixes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@58 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 03:36:40 +00:00
David Huynh
51de415809
Added recon config to column model so we can use that to do automatic schema alignment.
...
Fixed bug in recon operation that also tried to reconcile blank cells.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@57 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 01:07:45 +00:00
David Huynh
2eccab4c71
We need to recompute the row context dependencies after cell and column changes, but not after recon changes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@56 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-06 06:40:17 +00:00
David Huynh
bf79b24bf3
A bit more modeling for the proto-graph.
...
Two spinners to prepare for more async UI indications.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@55 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-06 02:54:01 +00:00
David Huynh
8701d504e6
More error checking.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@54 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-06 02:41:31 +00:00
David Huynh
149330fe05
Fixed bug in context row dependency calculations.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@53 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 23:32:29 +00:00
David Huynh
98a16ca500
Render contextual rows when filtered.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@51 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 23:05:00 +00:00
David Huynh
755b01c2c4
Keep number of bins below 100, or Google Charts won't work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@50 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 20:12:56 +00:00
David Huynh
3e30ab2a33
Added length function and support for creating custom numeric (range slider) facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@48 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 19:43:43 +00:00
David Huynh
16dda46a61
Refactored importers, adding support for Excel files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@47 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 19:19:38 +00:00
David Huynh
3ea230b11c
Minor bug fixes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@46 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 08:56:44 +00:00
David Huynh
8444136be1
Added control constructs to expression language.
...
Fixed bugs in multi-value cell split/join operations.
Condensed movies-condensed.tsv test data further.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@45 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 08:39:10 +00:00
David Huynh
937d91ab1d
Implement split and join multi-value cells features.
...
Reorganized column header popup menu.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@44 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 06:29:27 +00:00
David Huynh
62c8c72dc4
Added export row feature.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@43 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 01:58:06 +00:00
David Huynh
93f0dfd63a
Started to render column groups. Added a sample data set with CVTs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@42 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 01:27:14 +00:00
David Huynh
f8e15798e2
Implemented column addition and removal features.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@41 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 23:38:40 +00:00
David Huynh
bacb71ab6d
Refactoring to support abstract operations.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@40 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 20:51:44 +00:00
David Huynh
9a63bdd821
Unified the way expression bindings are created, so everywhere we'll have the same variable bindings.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@38 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 06:30:56 +00:00
David Huynh
4ee1684fa3
Break the commands package into several for clearer organization.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@37 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 06:12:22 +00:00
David Huynh
720f147c09
Moved concrete change classes out to their own package. Added a few change classes for columns, but they are not hooked up yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@36 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 06:00:26 +00:00
David Huynh
863493950f
Started to model the protograph for schema mapping.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@35 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 00:57:09 +00:00
David Huynh
44ea417d0d
Range slider facets now display sub-distribution when filtered by other facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@34 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 00:29:40 +00:00
David Huynh
21f2403146
Encapsulated a bunch of fields.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@33 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 21:57:38 +00:00
David Huynh
97fd9422f6
Started to add the key columns and column groups into the data model.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@32 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 20:55:48 +00:00
David Huynh
fb34d6f507
Implemented preview expression dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@31 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 09:27:53 +00:00
David Huynh
4ec35cf5bb
A few find/replace from gridlock to gridworks.
...
Added list of existing projects on front page.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@28 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 06:23:40 +00:00
David Huynh
047f59e461
Renamed java packages.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@27 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 02:29:47 +00:00
David Huynh
ed5eae83af
Added histograms to range slider facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@25 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 00:20:42 +00:00
David Huynh
00696a96fc
Added commands to approve, approve new, and discard recon candidates.
...
Support sections in menus.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@24 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 20:50:35 +00:00
David Huynh
ac46653aa3
Added text search facet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@23 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 19:16:09 +00:00
David Huynh
f870333715
Added a bunch of custom recon facets. Implemented range slider facet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@20 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 01:36:02 +00:00
David Huynh
58f2dd8f60
Recon process seems to be working. We can now filter rows by recon features using custom expressions.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@19 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 20:57:52 +00:00
David Huynh
fe8810aa07
Refactored to use JSONWriter to be more efficient at generating output.
...
Added created and modified time fields to project metadata.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@18 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 19:16:09 +00:00
David Huynh
17cbe6b62d
Added dialog system.
...
Started to implement recon dialog.
Added long-running recon process, which doesn't actually do reconciliation just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@17 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 08:31:50 +00:00
David Huynh
06b5373151
Client-side UI widget for long running processes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@16 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 03:22:35 +00:00
David Huynh
8b22eb594f
Refactoring and adding more functions.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@14 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-31 23:15:50 +00:00
David Huynh
e24d40c3da
Faceted browsing is starting to work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@13 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-30 01:05:30 +00:00
David Huynh
dce2ec71aa
Started client-side scaffolding for faceted browsing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@12 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-29 00:46:15 +00:00
David Huynh
e0365f45c8
More server-side scaffolding for faceted browsing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@11 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-28 01:43:09 +00:00
David Huynh
23b9e313b8
Implemented expression parser.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@10 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-27 22:27:22 +00:00
David Huynh
6889d0e58a
Server-side scaffolding for faceted browsing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@9 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-27 07:52:05 +00:00
David Huynh
73ecc8f5d2
Skip any row that has all empty cells.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@8 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-27 02:20:13 +00:00
David Huynh
e5bba41062
We now have a few basic text transform features. The history widget is also starting to work. It depends on a process queue in the back-end.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@7 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-27 01:48:42 +00:00
David Huynh
9f436ad717
More scaffolding.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@4 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-26 05:17:14 +00:00
David Huynh
e1fa9279eb
More scaffolding. We can now upload a tsv/csv file and create and persist a project.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@3 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-25 22:51:25 +00:00
David Huynh
22040a8348
Initial import.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-24 21:09:50 +00:00