Commit Graph

57 Commits

Author SHA1 Message Date
Antonin Delpeuch
11c2c7cf57 Update httpcore library 2018-04-25 17:43:14 +02:00
Antonin Delpeuch
c638189803 Update httpclient, httpmime and fluent-hc
Closes #1583.
2018-04-25 15:37:44 +02:00
Jacky
de42f4d85a migrate from JRDF to JENA library 2018-04-08 22:38:23 -04:00
Jacky
c4b0ff6bea data package metadata (#1398)
* fix the appbundle issue #1209

* fix #1162

allow the JRE 9

* fix the package declarations

* remove the _ from the method name

* use the explicit scoping

* remote extra ;

* fix issued from codacy

* fix issued from codacy

* add preferences link to the index page

* handle the empty user metadata

* fix 'last modified' sorting issue #1307

* prevent overflow of the table. issue #1306

* add isoDateParser to sort the date

* prevent overflow of the project index

* remove sorter arrow for action columns

* disable editing the internal metadata

* adjust the width of the table

* change MetaData to Metadata

* change the filed name from rowNumber to rowCount

* put back the incidently deleted gitignore

* add double quote to prevent word splitting

* UI improvement on metadata view and project list view

* remove the date field in metadata

* message notification of the free RAM. Issue #1295

* UI tuning for metadata view

* shorten the ISO date to locale date format

* Added translation using Weblate (Portuguese (Brazil))

* remove the rename link

* Ignore empty language files introduced by Weblate

* Add UI for Invert text filter

* Backend support for Inverting Text search facets

* Fix reset on text search facet

* More succinct return statements

* add tests for SetProjectMetadataCommand

* Tidying up for Codacy

* Added Tests for TextSearchFilter

* Corrections for Codacy

* More code tidy up

* let the browser auto fit the table cell when resizing/zooming

* fix import multiple excel with mulitple sheets issue #1328

* check if the project has the userMetadata

* fix the unit test
support multi files with multi tables for open office

* prevent the same key for user metadata

* replace _ with variable for exception

* fix the no-undef issue

* to adjust the width of transform dialog. issue #1332

* fix the row count refresh issue

* extract method

* move the log message

* cosmatic changes for codacy

* fix typo

* bump to version 2.8

* .gitignore is now working

* preview stage won't have the metadata populated, so protect NPE

* Update README.md

No more direct link to the last version tag, which will avoid having to think of updating the readme

* refacotring the ProjectMetadata class

* introduce the IMetadata interface

* create submodule of dataschema

* add back

* setup lib for dataschema; upgrade the apache lang to lang3

* replace escape* functions from apache lang3

* replace the ProjectMetadata with IMetadata interface

* add missing jars

* set the IMetadata a field of Project

* remove PreferenceStore out of Project model

* fix test SetProjectMetadataCommandTests by casting

* introdcue the AbstractMetadata

* introdcue the AbstractMetadata

* reorganize the metadata package

* allow have mulitiple metadata for a project

* support for mulitple metadata format

* remove jdk7 since 'table schema' java implmentation only support jdk8+

* set execute permission for script

* fix the Unit Test after Metadata refactoring

* restore the apache lang2.5 since jetty 6.1.22 depend on it

* add commons lang 2.5 jar

* git submodule add  https://github.com/frictionlessdata/datapackage-java

* remove the metadata parameter from the ProjectManager.registerProject method

* remove hashmap _projectsMetadata field from the ProjectManager and FileProjectManager

* init the Project.metadataMap

* fix Unit Test

* restore the ProjectMetaData map to ProjectManager

* put the ProjectMetaDta in place for ProjectManager and Project object

* check null of singleton instead of create a constructor just for test

* load the data package metadata

* importing data package

* importing data package

* encapsulate the Package class into DataPackageMetadata

* user _ to indicate the class fields

* introduce base URL in order to download the data files

* import data package UI and draft backend

* import data package UI

* fix typo

* download the data set pointed from metadata resource

* save and load the data package metadata

* avoid magic string

* package cleanup

* set the java_version to 1.8

* set the min jdk to 1.8

* add the 3rd party src in the build.xml

* skip the file selection page if only 1 DATA file

* add files structure for json editor

* seperate out the metadata file from the retrival file list

* rename the OKF_METADATA to DATAPACKAGE_METADATA

* clean up

* implement GetMetadateCommand class

* display the metadata in json format

* git submodule update --remote --merge

* adjust the setting after pulling from datapackage origin

* fix the failed UT DateExtensionTests.testFetchCounts due to new json jar json-20160810.jar will complain: JSONObject["float"] not a string.

* clean up the weird loop array syntax get complained

* remove the unused constant

* export in data package format

* interface cleanup

* fix UT

* edit the metadata

* add UT for SetMetadataCommand

* fix UT for SetMetadataCommand

* display the data package metadata link on the project index page

* update submodule

* log the exceptions

* Ajv does not work properly, use the back end validation instead

* enable the validation for jsoneditor

* first draft of the data validation

* create a map to hold the constraint and its handler

* rename

* support for minLength and maxLength from spec

* add validate command

* test the opeation instead of validate command

* rename the UT

* format the error message and push to the report

* fix row number

* add resource bundle for validator

* inject the code of the constrains

* make the StrSubstitutor works

* extract the type and format information

* add the customizedFormat to interface to allow format properly

* get rid of magic string

* take care of missing parts of the data package

* implement RequiredConstraint

* patch for number type

* add max/min constraints

* get the constrains directly from field

* implement the PatternConstraint

* suppress warning

* fix the broken UT when expecting 2 digits fraction

* handle the cast and type properly

* fix the missing resource files for data package when run from command line

* use the copy instead of copydir

* add script for appveyor

* update script for appveyor

* do recursive clone

* correct the git url

* fix clone path

* clone folder option does not work

* will put another PR for this. delete for now

* revert the interface method name

* lazy loading the project data

* disable the validate menu for now

* add UT

* assert UTs

* add UT

* fix #1386

* remove import

* test the thread

* Revert "test the thread"

This reverts commit 779214160055afe3ccdcc18c57b0c7c72e87c824.

* fix the URLCachingTest UT

* define the template data package

* tidy up the metadata interface

* check the http response code

* fix the package

* display user friendly message when URL path is not reachable

* populate the data package schema

* Delete hs_err_pid15194.log

* populate data package info

* add username  preference and it will be pulled as the creator of the metadata

* undo the project.updateColumnChange() and start to introduce the fields into the existing core model

* tightly integrate the data package metadata

* tightly integrate the data package metadata for project level

* remove the submodule

* move the edit botton

* clean up build

* load the new property

* load the project metadata

* fix issues from codacy

* remove unused fields and annotation

* check the http response code firstly

* import zipped data package

* allow without keywords

* process the zip data package from url

* merge the tags

* check store firstly

* remove the table schema src

* move the json schema files to schema dir

* add comment

* add comment

* remove git moduels

* add incidently deleted file

* fix typo

* remove SetMetadataCommand

* revert change

* merge from master
2018-02-02 13:24:19 +00:00
Antonin Delpeuch
c9cc4fb262 Update Jackson to 2.9.1
Closes #1262
2017-10-09 17:38:09 +01:00
Antonin Delpeuch
88aa6f113d Switch to newer version of swc-parser-lazy 2017-08-13 20:14:29 +01:00
Antonin Delpeuch
e6406f56ef Initial version of the wikitext importer 2017-08-13 11:26:59 +01:00
Tom Morris
1222acdc57 Update POI to 3.13 - replaces #1029 2015-09-25 20:06:21 -04:00
Tom Morris
de66afa512 Revert " Use new algorithm for levenshtein clustering" 2015-09-25 16:44:25 -04:00
Thad Guidry
94e219042e Merge pull request #1007 from lispc/master
Use new algorithm for levenshtein clustering
2015-09-21 20:23:45 -05:00
jackyq2015
6e315853a5 upgrade POI from 3.8 to 3.12 2015-07-11 11:08:29 -04:00
lispc
87c342a484 change EditDistanceJoiner.jar to newest version 2015-06-02 01:09:38 +08:00
lispc
4ce327dba5 change EditDistanceJoiner.jar to multicore version 2015-06-02 00:41:01 +08:00
lispc
43e441a4d0 Use new algorithm for levenshtein clustering 2015-06-01 20:35:21 +08:00
Tom Morris
cbc3ab78c7 Revert "Update Guava to 18.0"
This reverts commit bfc39d9e8c.
2015-05-14 15:24:59 -04:00
Tom Morris
bfc39d9e8c Update Guava to 18.0 2014-12-21 12:32:50 -05:00
Tom Morris
c68c1bb2b1 Upgrade to Clojure 1.5.1 & switch to clojure-slim JAR - #792 2013-08-26 19:40:37 -04:00
Tom Morris
6440194658 Update broken Butterfly build 2013-08-18 12:37:26 -04:00
Tom Morris
25a3de410f Update to patched version of Butterfly - fixes #652 2013-08-18 11:24:33 -04:00
Tom Morris
cf6e855bf5 Update to latest patched OpenCSV 2013-08-07 16:18:48 -04:00
Tom Morris
1353f7f7aa Update Apache HTTP client JARs to 4.2.5 2013-07-10 15:16:21 -04:00
Tom Morris
163546de94 Remove update JARs 2013-07-09 14:17:40 -04:00
Tom Morris
56389588e3 Update to Jackson 1.9.12 2013-03-23 08:15:08 -04:00
Tom Morris
975d18942f Update to Clojure 1.4. Closes issue #634 2013-02-12 00:15:35 -05:00
Tom Morris
008df21b33 Update OpenCSV to our fork with multi-character separator support
Built from the openrefine-2.6 branch in the OpenRefine/opencsv repo
2013-01-27 15:21:53 -05:00
Tom Morris
6e50347c77 Delete unused bad JSON jar 2012-11-18 11:06:35 -05:00
Tom Morris
be6bdfe50a Restore old JSON library jar
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2535 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-21 17:29:47 +00:00
Tom Morris
60c3a31242 Update Jackson and JSON libs
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2532 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-18 21:46:49 +00:00
Tom Morris
4bb6c43982 task 604: add Guava to main project so that we're not dependent on an extension
http://code.google.com/p/google-refine/issues/detail?id=604

git-svn-id: http://google-refine.googlecode.com/svn/trunk@2531 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-15 13:33:17 +00:00
Stefano Mazzocchi
5dffd249de updating signpost to the latest release
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2514 7d457c2a-affb-35e4-300a-418c747d4874
2012-07-13 06:50:53 +00:00
Tom Morris
166b176ba2 Update to Apache POI 3.8
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2486 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-29 04:41:42 +00:00
Tom Morris
8ff6c5617f Update Jackson parser to 1.9.5
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2448 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-01 18:11:28 +00:00
Stefano Mazzocchi
8184e16bb9 updating http client and http core to the latest released versions
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2351 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-01 21:46:56 +00:00
Tom Morris
ca17e1ef0a New importer for Open Document Format (ODF) spreadsheet files (.ods)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2323 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:27:40 +00:00
Tom Morris
f7475e2f3f Update JSON parser lib to Jackson 1.8.6
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2288 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 16:59:39 +00:00
Tom Morris
830a2c376f Update to Apache Commons Codec 1.5
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2095 7d457c2a-affb-35e4-300a-418c747d4874
2011-06-08 00:58:47 +00:00
Tom Morris
149cf1dfcb Update to Apache POI 3.7
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2062 7d457c2a-affb-35e4-300a-418c747d4874
2011-05-25 06:10:32 +00:00
Tom Morris
518ca1e654 Update to Apache POI 3.7
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2061 7d457c2a-affb-35e4-300a-418c747d4874
2011-05-25 06:07:44 +00:00
Iain Sproat
f55f11cd0d Adding classes to now make it possible to parse Html in GREL. Uses small subset of methods from the JSoup library, licensed under the MIT license.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1948 7d457c2a-affb-35e4-300a-418c747d4874
2010-12-06 23:15:24 +00:00
David Huynh
8d1f2d44b9 Patched the json lib to allow up to 100 levels of nesting.
Fixed ImportProjectCommand to redirect from the error page back to /index rather than /index.html.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1270 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 00:21:54 +00:00
Stefano Mazzocchi
4af3a4cbf5 allow gridworks to load extensions also from the same place where data is stored (in the 'extension' subfolder')
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1258 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-02 01:17:54 +00:00
Stefano Mazzocchi
3bb0d3918c adding the latest butterfly with the ability to avoid loading specific extensions (selected with strings or regexps on their names) via a property
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1247 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:08:47 +00:00
Stefano Mazzocchi
37099b4c9f new butterfly version: mount extensions in their own subdirectory in the URL space
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1233 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 01:14:47 +00:00
Stefano Mazzocchi
deaaf78748 adding latest version of butterfly (that supports module autowiring)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1179 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-20 22:47:14 +00:00
David Huynh
f069780bfa Added support for bundling .js files to shave off some loading time.
For GetRowsCommand, tried to use jsonp but that didn't seem to improve performance much.
Gzip http responses of various text-based mime types.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1122 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-31 06:51:11 +00:00
Stefano Mazzocchi
5193630b9a updating to the latest butterfly (and remove the version number that is a pain to maintain for now)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1034 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-25 07:03:16 +00:00
David Huynh
52f1e2ba53 Updated butterfly jar.
Extracted several dialogs' html fragments from their .js files out to .html files.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1016 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-21 21:59:53 +00:00
Iain Sproat
7ced0cb31e New feature for importing text files (CSV and TSV). By selecting the checkbox in index.html it allows the effects of quotation marks around data values to be ignored.
Unit test added for this.

This has required a further branch to opencsv - patch sent to opencsv project and can be tracked at  https://sourceforge.net/tracker/?func=detail&aid=3018599&group_id=148905&atid=773543

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1010 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-20 14:47:45 +00:00
David Huynh
778873957f Updated butterfly to r25, which now initializes modules in order based on their dependencies.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1006 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-20 01:06:47 +00:00
Stefano Mazzocchi
4c28f616f7 updating butterfly to the latest trunk (now allows modules to change their name with properties)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@994 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-19 02:31:36 +00:00