RandomSec/docs/docs/manual/starting.md

---
id: starting
title: Starting a project
sidebar_label: Starting a project
---

## Overview

OpenRefine doesn’t allow you to create a dataset from nothing - you can only start a project by importing in some existing data.

No matter where your data comes from, OpenRefine doesn’t modify your original data source. It copies all the information from your input, creates its own project file, and stores it in your [workspace directory](installing.md#set-where-data-is-stored).  

The data and all of your edits are automatically saved inside the project file. When you’re finished modifying the data, you can export it back out into the file format of your choice. 

You can also receive and open other people’s projects, or send them yours, by exporting a project archive and importing it. 

## Create project by importing data

When you start OpenRefine, you’ll be taken to the `Create Project` screen. You’ll see on the left side of the screen that your options are to: 

*   import data from a file on your computer
*   import data from a link to the web
*   import data by pasting in text from your clipboard
*   import data from a database (using SQL), and
*   import Sheets from Google Drive. 

From these sources, you can load any of the following file formats:

*   comma-separated values (CSV) or text-separated values (TSV)
*   Text files
*   Fixed-width columns
*   JSON
*   XML
*   ODF spreadsheet (ODS)
*   Excel (XLS or XLSX)
*   PC-Axis
*   MARC
*   RDF data (JSON-LD, N3, N-Triples, Turtle, RDF/XML)
*   Wikitext

More formats can be imported by [adding extensions that provide that functionality](https://openrefine.org/download.html). 

If you supply two or more files for one project, the files’ rows will be loaded in order that you specify, and OpenRefine will create a column at the beginning of the dataset with the source URL or file name in it to help you identify where each row came from. If the files have matching columns, the data will load in each column; if not, the successive files will append all of their new columns to the end of the dataset:

|File|Fruit|Quantity|Berry|Berry source|
|---|---|---|---|---|
|fruits.csv|Orange|4|
|fruits.csv|Apple|6|
|berries.csv||9|Mulberry|Greece|
|berries.csv||2|Blueberry|Canada|


For whichever method you choose, when you click `Next >>` you will be given a preview and a chance to configure the way OpenRefine interprets the file.  

### Methods to import data

#### Get data from this computer

Click on `Browse…` and select a file on your hard drive. All files will be shown, not just compatible ones. 

If you import an archive file (something with the extension `.zip`, `.tar.gz`, `.tgz`, `.tar.bz2`, `.gz`, or `.bz2`), OpenRefine detects the most common file extension in it and loads all files with that extension into a single project.

#### Web Addresses (URLs)

Type or paste the URL to the data file into the field provided. You can add as many fields as you want. OpenRefine will download the file and preview it for you. 

If you supply two or more file URLs, OpenRefine will identify each one and ask you to choose which (or all) to load. 

Do not use this form to load a Google Sheet by its link; use the Google Data form instead. 

#### Clipboard

You can copy and paste in data from anywhere. OpenRefine will recognize comma-separated, tab-separated, or table-formatted information copied from sources such as word-processing documents, spreadsheets, and tables in PDFs. You can also just paste in a list of items that you want to turn into multi-column rows. OpenRefine recognizes each new text line as a row. 

This can be useful if you want to pre-select a specific number of rows from your source data, or paste together rows from different places, rather than delete unwanted rows later in the project interace. 

This can also be useful if you would like to paste in a list of URLs, which you can use later to fetch the data online and build columns with. 

#### Database (SQL)

You may want to pull the latest dataset directly from an online database. This could include an online catalogue, a Wordpress or similar content management system, or a [digital repository or collection management system](https://bits.ashleyblewer.com/blog/2017/08/09/collection-management-system-collection/).

OpenRefine can connect to PostgreSQL, MySQL, MariaDB, and SQLite database systems. It will automatically populate the “Port” field based on which of these you choose, but you can manually edit this if needed.

If you have a downloaded database (`.db`) file, you can supply the path to the file on your computer directly in the “Database” field at the bottom of the form. You can leave the rest of the fields blank.

To import data directly from a database online, you will need to do two things:

*   Add OpenRefine (running from your computer) to an account authorized to access your database
*   Set up OpenRefine to access that database using that account 

![A screenshot of connecting to a database](img/databaseconnect.jpg "A screenshot of connecting to a database.")

Log in to your hosting provider. Get the database type (such as MySQL), database name, and the URL (either an IP address, such as `127.0.0.1`, or the domain that uses the database). Then look at the accounts authorized for access. You may wish to create a new account just for OpenRefine, or add OpenRefine to an existing account. 

Each host will have a slightly different method, but generally speaking: look for “accounts with access” to the database you wish to authorize, and within the settings for that account, look for “allowable hosts” or “access hosts.” 

In that list, add the IP address of your own computer, because that is where the OpenRefine access request will be coming from. You can find this easily, by clicking “Test” within OpenRefine once the rest of the information is filled out: OpenRefine will give you an error that looks like 

``` error:Access denied for user 'yourusername'@'123-45-67-89.yourISP.com' ```

Take your IP address from this error message and put that, with periods instead of hyphens, into the “allowable hosts” field on the account you’re trying to use. Add a wildcard to the end of your IP address (“123.45.67.89%”). Save that setting, and then test the connection again with OpenRefine.

You can either connect just once and gather data, or save the connection to use it again later. If you press “Connect” without saving, OpenRefine will forget all the information you just entered. 

If you’d like to save the connection, name your connection in a way you will recognize later. Click “Save” and it will appear in the “Saved Connections” list on the left. Now, click on the “...” ellipsis to the right of the connection you’ve saved, and click “Connect.” 

If your connection is successful, you will see a Query Editor. From here you can write an [SQL query](https://www.w3schools.com/sql/) to pull the specific data you need. 

If you need help, you may be able to find instructions from your hosting provider. Here are the guides from:


*   [Dreamhost](https://help.dreamhost.com/hc/en-us/articles/214883058-How-do-I-connect-to-my-database-using-a-third-party-program-)
*   [GoDaddy](https://ca.godaddy.com/help/connect-remotely-to-a-mysql-database-in-my-linux-hosting-account-16103)


#### Google Data

You have two ways to load in data from Google Sheets:


*   A link to an accessible Google Sheet (that is, one with link-sharing turned on)
*   Selecting a Google Sheet in your Google Drive


##### Google Sheet by URL

You can import data from any Google Sheet that has link-sharing turned on. Paste in a URL that looks something like

```https://docs.google.com/spreadsheets/……….../edit?usp=sharing```

This will only work with Sheets, not with any other Google Drive file that might have an available link. 


##### Google Sheet from Drive

You can authorize OpenRefine to access your Google Drive data and import data from any Google Sheet it finds there. This will include Sheets that belong to you and Sheets that are shared with you, as well as Sheets that are in your trash. 

OpenRefine will not show spreadsheets that are in your email inbox or stored in any other Google property - only in Drive. It also won’t show all compatible file formats, only Sheets files.

OpenRefine will generate a list of all Sheets it finds, with the most recently modified Sheets at the top. If a file you’ve just added isn’t showing in this list, you can close and restart OpenRefine, or simply navigate to an existing project, open it, then head back to the “Create Project” window and check again. 

When you click “Preview” the Sheet will open in a new browser tab. When you click the Sheet title, OpenRefine will begin to process the data.


### Previewing the project

Once OpenRefine is ready to import the data, you will see a screen with “Configure Parsing Options” at the top. You’ll see a preview of the first 100 rows and all identified columns. 

At the bottom of the screen you will find options for telling OpenRefine how to process what it has found. You can tell it which row(s) to parse as column headers, as well as to ignore any number of rows at the top. You can also select a specific range of rows to work with, by discarding some rows at the top (excluding the header) and limiting the total number of rows it loads.

OpenRefine tries to guess how to parse your data based on the file extension. For example, .xml files are going to be parsed as though they are formatted in XML. An unknown file extension (or your clipboard copy-paste) is assumed to be either tab-separated or comma-separated. OpenRefine looks for a tab character; if one is found, it assumes you have imported tab-separated data.

If Google isn’t certain what format you imported, it will provide a list of possibilities under “Parse data as” and some settings. You can specify a custom separator now, or split columns later on in the project interface. 

If you imported a spreadsheet with multiple worksheets, they will be listed along with the number of rows they contain. You can only select data from one worksheet. 

Note that OpenRefine does not preserve any formatting, such as cell or text colour, that my have been in the original data file. 

You should create a project name at this stage. You can also supply tags to keep your projects organized. When you’re happy with the preview, click “Create Project.”


## Import a project

Because OpenRefine only runs locally on your computer, you can’t have a project accessible to more than one person at the same time. 

The best way to collaborate with another person is to export and import projects that save all your changes, so that you can pick up where someone else left off. You can also [export projects](export.md) and import them to new computers of your own, such as for working on the same project from the office and from home. 

An exported project will include all of the [history](interface/projectscreen.md#history), so you can see (and undo) all the changes from the previous user. It is essentially a point-in-time snapshot of their work. OpenRefine only exports projects as `.tar.gz` files at this time. 


### Instructions

Once someone has sent you a project archive file from their computer, you can save it anywhere, including your Downloads folder. 

In the left-hand menu of the home screen, click “Import Project.” You will see the following screen:

![A screenshot of the Import screen](img/import.jpg "A screenshot of the Import screen.")


Click “Browse…” and navigate to wherever you saved the file you were sent (for example, your Downloads folder). 

You can rename the project if you’d like - we recommend adding your name, a date, or a version number, if you’re planning to continue collaborating with another person (or working from multiple computers).

Then, click “Import Project.”  Your project should appear with a step count beside “undo/redo” if steps were saved by the exporter. 

OpenRefine will store the project in its own workspace directory, so you can now delete the original file that was sent to you. 


## Project management

You can access all of your created projects by clicking on “Open Project.” Your project list can be organized by modification date, title, row count, and other metadata you can supply (such as subject, descripton, tags, or creator). To edit the fields you see here, click “About” to the left of each project. There you can Edit a number of available fields. You can also see the project ID that corresponds to the name of the folder in your work directory.  


### Naming projects 

You may have multiple projects from the same dataset, or multiple versions from sharing a project with another person. OpenRefine automatically generates a project name from the imported file, or “clipboard” when you use Clipboard importing. Project names don’t have to be unique, so OpenRefine will create many projects with the same name unless you intervene. 

You can name a project when you create it or import it, and you can rename a project by opening it and clicking on the project name at the top of the screen. 


### Autosaving 

OpenRefine saves all of your actions (everything you can see in the Undo/Redo panel). That  includes flagging and starring rows.

It doesn’t, however, save your facets, filters, or any kind of “view” you may have in place while you work. This includes the number of rows showing, whether you are showing your data as rows or records, and any sorting or column collapsing you may have done. A good rule of thumb is: if it’s not showing in Undo/Redo, you will lose it when you leave the project workspace. 

You can only save and share facets and filters, not any other type of view. To save current facets and filters, click “Permalink.” The project will reload with a different URL, which you can then copy and save elsewhere. This permalink will save both the facets and filters you’ve set, and the settings for each one (such as sorting by count rather than by name). 


### Deleting projects

You can delete projects, which will erase the project files from the work directory on your computer. This is immediate and cannot be undone.

 
Go to “Open Project” and find the project you want to delete. Click on the X to the left of the project name. There will be a confirmation dialog. 


### Project files

You can find all of your raw project files in your work directory. They will be named according to the unique Project ID that OpenRefine has assigned them, which you can find on the “Open Project” screen, under the “About” button for each project. 

You can point OpenRefine to use another work directory using [Preferences](running.md#preferences).
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
+								---
 								id: starting
 								title: Starting a project
 								sidebar_label: Starting a project
 								---
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								## Overview
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								OpenRefine doesn’t allow you to create a dataset from nothing - you can only start a project by importing in some existing data.
 								No matter where your data comes from, OpenRefine doesn’t modify your original data source. It copies all the information from your input, creates its own project file, and stores it in your [workspace directory](installing.md#set-where-data-is-stored).
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								The data and all of your edits are automatically saved inside the project file. When you’re finished modifying the data, you can export it back out into the file format of your choice.
 								You can also receive and open other people’s projects, or send them yours, by exporting a project archive and importing it.
 								## Create project by importing data
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								When you start OpenRefine, you’ll be taken to the `Create Project` screen. You’ll see on the left side of the screen that your options are to:
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								*   import data from a file on your computer
 								*   import data from a link to the web
 								*   import data by pasting in text from your clipboard
 								*   import data from a database (using SQL), and
 								*   import Sheets from Google Drive.
 								From these sources, you can load any of the following file formats:
 								*   comma-separated values (CSV) or text-separated values (TSV)
 								*   Text files
 								*   Fixed-width columns
 								*   JSON
 								*   XML
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								*   ODF spreadsheet (ODS)
 								*   Excel (XLS or XLSX)
 								*   PC-Axis
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
+								*   MARC
 								*   RDF data (JSON-LD, N3, N-Triples, Turtle, RDF/XML)
 								*   Wikitext
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								More formats can be imported by [adding extensions that provide that functionality](https://openrefine.org/download.html).
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								If you supply two or more files for one project, the files’ rows will be loaded in order that you specify, and OpenRefine will create a column at the beginning of the dataset with the source URL or file name in it to help you identify where each row came from. If the files have matching columns, the data will load in each column; if not, the successive files will append all of their new columns to the end of the dataset:
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								|File|Fruit|Quantity|Berry|Berry source|
 								|---|---|---|---|---|
 								|fruits.csv|Orange|4|
 								|fruits.csv|Apple|6|
 								|berries.csv||9|Mulberry|Greece|
 								|berries.csv||2|Blueberry|Canada|
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								For whichever method you choose, when you click `Next >>` you will be given a preview and a chance to configure the way OpenRefine interprets the file.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								### Methods to import data
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								#### Get data from this computer
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								Click on `Browse…` and select a file on your hard drive. All files will be shown, not just compatible ones.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								If you import an archive file (something with the extension `.zip`, `.tar.gz`, `.tgz`, `.tar.bz2`, `.gz`, or `.bz2`), OpenRefine detects the most common file extension in it and loads all files with that extension into a single project.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								#### Web Addresses (URLs)
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								Type or paste the URL to the data file into the field provided. You can add as many fields as you want. OpenRefine will download the file and preview it for you.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								If you supply two or more file URLs, OpenRefine will identify each one and ask you to choose which (or all) to load.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								Do not use this form to load a Google Sheet by its link; use the Google Data form instead.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								#### Clipboard
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								You can copy and paste in data from anywhere. OpenRefine will recognize comma-separated, tab-separated, or table-formatted information copied from sources such as word-processing documents, spreadsheets, and tables in PDFs. You can also just paste in a list of items that you want to turn into multi-column rows. OpenRefine recognizes each new text line as a row.
 								This can be useful if you want to pre-select a specific number of rows from your source data, or paste together rows from different places, rather than delete unwanted rows later in the project interace.
 								This can also be useful if you would like to paste in a list of URLs, which you can use later to fetch the data online and build columns with.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								#### Database (SQL)
-												Simplifying the database instructions

And, because drafting is never done, tweaks to the installation section.

											
										
										
											2020-07-21 23:11:49 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								You may want to pull the latest dataset directly from an online database. This could include an online catalogue, a Wordpress or similar content management system, or a [digital repository or collection management system](https://bits.ashleyblewer.com/blog/2017/08/09/collection-management-system-collection/).
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								OpenRefine can connect to PostgreSQL, MySQL, MariaDB, and SQLite database systems. It will automatically populate the “Port” field based on which of these you choose, but you can manually edit this if needed.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								If you have a downloaded database (`.db`) file, you can supply the path to the file on your computer directly in the “Database” field at the bottom of the form. You can leave the rest of the fields blank.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
 								To import data directly from a database online, you will need to do two things:
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
 								*   Add OpenRefine (running from your computer) to an account authorized to access your database
 								*   Set up OpenRefine to access that database using that account
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
 								![A screenshot of connecting to a database](img/databaseconnect.jpg "A screenshot of connecting to a database.")
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								Log in to your hosting provider. Get the database type (such as MySQL), database name, and the URL (either an IP address, such as `127.0.0.1`, or the domain that uses the database). Then look at the accounts authorized for access. You may wish to create a new account just for OpenRefine, or add OpenRefine to an existing account.
 								Each host will have a slightly different method, but generally speaking: look for “accounts with access” to the database you wish to authorize, and within the settings for that account, look for “allowable hosts” or “access hosts.”
 								In that list, add the IP address of your own computer, because that is where the OpenRefine access request will be coming from. You can find this easily, by clicking “Test” within OpenRefine once the rest of the information is filled out: OpenRefine will give you an error that looks like
 								``` error:Access denied for user 'yourusername'@'123-45-67-89.yourISP.com' ```
 								Take your IP address from this error message and put that, with periods instead of hyphens, into the “allowable hosts” field on the account you’re trying to use. Add a wildcard to the end of your IP address (“123.45.67.89%”). Save that setting, and then test the connection again with OpenRefine.
 								You can either connect just once and gather data, or save the connection to use it again later. If you press “Connect” without saving, OpenRefine will forget all the information you just entered.
 								If you’d like to save the connection, name your connection in a way you will recognize later. Click “Save” and it will appear in the “Saved Connections” list on the left. Now, click on the “...” ellipsis to the right of the connection you’ve saved, and click “Connect.”
 								If your connection is successful, you will see a Query Editor. From here you can write an [SQL query](https://www.w3schools.com/sql/) to pull the specific data you need.
 								If you need help, you may be able to find instructions from your hosting provider. Here are the guides from:
 								*   [Dreamhost](https://help.dreamhost.com/hc/en-us/articles/214883058-How-do-I-connect-to-my-database-using-a-third-party-program-)
 								*   [GoDaddy](https://ca.godaddy.com/help/connect-remotely-to-a-mysql-database-in-my-linux-hosting-account-16103)
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
 								#### Google Data
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								You have two ways to load in data from Google Sheets:
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
+								*   A link to an accessible Google Sheet (that is, one with link-sharing turned on)
 								*   Selecting a Google Sheet in your Google Drive
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								##### Google Sheet by URL
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								You can import data from any Google Sheet that has link-sharing turned on. Paste in a URL that looks something like
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								```https://docs.google.com/spreadsheets/……….../edit?usp=sharing```
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								This will only work with Sheets, not with any other Google Drive file that might have an available link.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
 								##### Google Sheet from Drive
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								You can authorize OpenRefine to access your Google Drive data and import data from any Google Sheet it finds there. This will include Sheets that belong to you and Sheets that are shared with you, as well as Sheets that are in your trash.
 								OpenRefine will not show spreadsheets that are in your email inbox or stored in any other Google property - only in Drive. It also won’t show all compatible file formats, only Sheets files.
 								OpenRefine will generate a list of all Sheets it finds, with the most recently modified Sheets at the top. If a file you’ve just added isn’t showing in this list, you can close and restart OpenRefine, or simply navigate to an existing project, open it, then head back to the “Create Project” window and check again.
 								When you click “Preview” the Sheet will open in a new browser tab. When you click the Sheet title, OpenRefine will begin to process the data.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								### Previewing the project
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								Once OpenRefine is ready to import the data, you will see a screen with “Configure Parsing Options” at the top. You’ll see a preview of the first 100 rows and all identified columns.
 								At the bottom of the screen you will find options for telling OpenRefine how to process what it has found. You can tell it which row(s) to parse as column headers, as well as to ignore any number of rows at the top. You can also select a specific range of rows to work with, by discarding some rows at the top (excluding the header) and limiting the total number of rows it loads.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								OpenRefine tries to guess how to parse your data based on the file extension. For example, .xml files are going to be parsed as though they are formatted in XML. An unknown file extension (or your clipboard copy-paste) is assumed to be either tab-separated or comma-separated. OpenRefine looks for a tab character; if one is found, it assumes you have imported tab-separated data.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								If Google isn’t certain what format you imported, it will provide a list of possibilities under “Parse data as” and some settings. You can specify a custom separator now, or split columns later on in the project interface.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								If you imported a spreadsheet with multiple worksheets, they will be listed along with the number of rows they contain. You can only select data from one worksheet.
 								Note that OpenRefine does not preserve any formatting, such as cell or text colour, that my have been in the original data file.
 								You should create a project name at this stage. You can also supply tags to keep your projects organized. When you’re happy with the preview, click “Create Project.”
 								## Import a project
 								Because OpenRefine only runs locally on your computer, you can’t have a project accessible to more than one person at the same time.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								The best way to collaborate with another person is to export and import projects that save all your changes, so that you can pick up where someone else left off. You can also [export projects](export.md) and import them to new computers of your own, such as for working on the same project from the office and from home.
 								An exported project will include all of the [history](interface/projectscreen.md#history), so you can see (and undo) all the changes from the previous user. It is essentially a point-in-time snapshot of their work. OpenRefine only exports projects as `.tar.gz` files at this time.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								### Instructions
 								Once someone has sent you a project archive file from their computer, you can save it anywhere, including your Downloads folder.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								In the left-hand menu of the home screen, click “Import Project.” You will see the following screen:
 								![A screenshot of the Import screen](img/import.jpg "A screenshot of the Import screen.")
 								Click “Browse…” and navigate to wherever you saved the file you were sent (for example, your Downloads folder).
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								You can rename the project if you’d like - we recommend adding your name, a date, or a version number, if you’re planning to continue collaborating with another person (or working from multiple computers).
 								Then, click “Import Project.”  Your project should appear with a step count beside “undo/redo” if steps were saved by the exporter.
 								OpenRefine will store the project in its own workspace directory, so you can now delete the original file that was sent to you.
 								## Project management
 								You can access all of your created projects by clicking on “Open Project.” Your project list can be organized by modification date, title, row count, and other metadata you can supply (such as subject, descripton, tags, or creator). To edit the fields you see here, click “About” to the left of each project. There you can Edit a number of available fields. You can also see the project ID that corresponds to the name of the folder in your work directory.
 								### Naming projects
 								You may have multiple projects from the same dataset, or multiple versions from sharing a project with another person. OpenRefine automatically generates a project name from the imported file, or “clipboard” when you use Clipboard importing. Project names don’t have to be unique, so OpenRefine will create many projects with the same name unless you intervene.
 								You can name a project when you create it or import it, and you can rename a project by opening it and clicking on the project name at the top of the screen.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
+								### Autosaving
 								OpenRefine saves all of your actions (everything you can see in the Undo/Redo panel). That  includes flagging and starring rows.
 								It doesn’t, however, save your facets, filters, or any kind of “view” you may have in place while you work. This includes the number of rows showing, whether you are showing your data as rows or records, and any sorting or column collapsing you may have done. A good rule of thumb is: if it’s not showing in Undo/Redo, you will lose it when you leave the project workspace.
 								You can only save and share facets and filters, not any other type of view. To save current facets and filters, click “Permalink.” The project will reload with a different URL, which you can then copy and save elsewhere. This permalink will save both the facets and filters you’ve set, and the settings for each one (such as sorting by count rather than by name).
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
+								### Deleting projects
 								You can delete projects, which will erase the project files from the work directory on your computer. This is immediate and cannot be undone.
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
 								Go to “Open Project” and find the project you want to delete. Click on the X to the left of the project name. There will be a confirmation dialog.
-												Adding mostly-empty docs (#2892)

Skeleton and initial content for documentation
											
										
										
											2020-07-07 21:16:27 +02:00
 								### Project files
-												Update starting.md

I am not at all clear on how well GitHub for Windows will handle multiple PRs pointing to the same work directory, but let's see what happens.

											
										
										
											2020-07-20 17:49:15 +02:00
+								You can find all of your raw project files in your work directory. They will be named according to the unique Project ID that OpenRefine has assigned them, which you can find on the “Open Project” screen, under the “About” button for each project.
 								You can point OpenRefine to use another work directory using [Preferences](running.md#preferences).