Dataset Management

Introduction

Data Management serves as the administrative backbone of the platform where technical governance meets platform usability. The system needs to support multiple datasets, agencies, and user roles.


Each dataset has Metadata and Admin tabs that provide authorized users with the tools needed to provide information about and govern access to each dataset in the system. While most users will interact with data through visualizations or spreadsheets, these two tabs are the interface where stewardship functions take place—ensuring that data is not only available, but properly described, securely controlled, and responsibly maintained.

Overview Tab

Each dataset has an Overview tab that is visible to all users. Administrators gain access to edit controls on this tab.


Dataset Title and Description

At the top of the page, the dataset’s title is prominently displayed along with a detailed description. This description should offer a narrative of what the dataset contains, what it measures, and its intended use. For administrators, these fields are editable and changes will be reflected publicly.


Dataset Information

Beneath, and to the right of, the description, additional dataset information fields are displayed:

  • Source URL - A place to add a direct link to the original data publisher, whether a federal agency, local authority, academic institution, or private data vendor.
  • Publisher Name - Identifies the organization responsible for the dataset’s production and maintenance.
  • Categories - A place to add tags to a dataset.
  • Type - The type of data that was selected in the uploader and should not be changed.
  • Update Interval - The frequency of which the data is updated.


Editing

To edit these fields while logged in as an administrator;


1. Hover over the title or description to display an edit (pencil) icon.

2. Click the edit icon.

3. Enter the title/description.

4. Click the "Save" button to keep (or the "Cancel" button to discard) the changes.


Versions

Versions represent multiple iterations of a dataset uploaded within the same structure. While data can be different from version to version, the columns of the dataset must remain the same or a new source must be created. Versioning allows for the updating of data and longitudinal dataset. Clicking on the Version ID of a version will bring you to an Overview page specific to that version. Here you can create and delete downloads for that version.

Creating and Deleting Downloads

On the Version-specific Overview page (see Versions above);

  1. Click "Create Download".
  2. Check all File Types to be available for download.
  3. Check all columns to be available for download. Note: Uncheck any custom columns.
  4. Click "State Download Creation".
  5. Once the system has finished processing and creating the download, it will be available on the source's Overview page.
Admin Tab

The Admin Tab contains the control system that allows authorized administrators to manage who can view, download, or modify each dataset within the platform. The Admin Tab has four sections;

  • User Access Controls - This section allows administrators to manage individual user accounts within the system. Here, you can assign users to groups, adjust their permission levels, activate or deactivate accounts, and review user roles.
  • Group Access Controls - defines which user groups have access to specific datasets or platform tools.
  • Events - Provides a data upload log.
  • Admin Actions - Links to three administrative actions; Advanced Metadata, Add Version, and Delete.


User and Group Access Controls

Perhaps the most critical component of data management is the Access Control system. This system allows authorized administrators to manage who can view, download, or modify each dataset within the platform. Access can be granted at either a User or Group level. When a dataset is uploaded, only the Administrator that uploaded the data is given permissions to interact with the dataset by default. They can add specific users in the User Access Controls to share the data privately with select user(s). The Group Access Controls can be used to share data with users that exist within specified categories. Groups might include internal teams (e.g., NYSDOT Admin), external partners (e.g., MPOs, planning consultants), or public users.


Each group or user’s access is governed by an authority level, represented by an integer value that determines the extent of their privileges.


Adding Users or Groups:

To grant access to a new User or Group;

1. Click the "Add Group Access" "Add User Access" button. A dropdown menu will appear, listing all currently recognized user groups within the system.

2. Selecting a User or Group from this list adds them to the list of authorized users/groups with the lowest Authority Level "1" or permission to view only.


Removing Users or Groups:

To revoke access, simply click the "Remove" button adjacent to the user or group’s entry in the access list.


Setting Permissions:

Each group’s authority level can be adjusted using an integer value:

  • 10 - Full Access: The group may edit dataset metadata, upload or overwrite data files, and delete the dataset entirely.
  • 2 - View + Download: The group may browse all visualizations and download the raw data but cannot make any changes.
  • 1 - View Only: The group may see the dataset on the platform but cannot download the raw data.

After adjusting authority levels, administrators must click the "Confirm" button to save changes. These permissions are applied immediately.


Note: At the time of this documentation draft, only these three permissions levels exist. Additional levels of permissions may outlined in the future and the documentation will be updated accordingly.


Events

Events provide an upload log for the dataset. The following columns are visible;

ETL Context ID - Unique Identifier

Type - The type of event that occurred.

Source Name - Name of the source.

User - User responsible for the event.

Started - Time since last the event occurred.

Duration - Duration of the event (if applicable)

ETL Status - The status of the event.


Clicking on a row in the the Event section will navigate the user to an expanded page for that task. This is a paginated list of tasks for that upload event that can be used to troubleshoot uploading errors.


Admin Actions

Advanced Metadata

The Advanced Metadata page is the location where Administrators can update the display names for each variable in a dataset. To edit a display name;

1. Hover over the "No Display Name" label to show the edit (pencil) icon.

2. Click the edit icon.

3. Enter the desired display name.

4. Click the "Save" button to keep (or the "Cancel" button to discard) the changes.


Add Version

Datasets can be updated with new versions. The "Add Version" button will navigate the Administrator to a Data Upload page for that source. To add a new version from this page, follow the instructions on the Data Upload section of this guidance documentation.


Delete

This deletes a dataset for all users. This action cannot be undone.