Data Wrangling

Introduction

Each challenge hosted on the Spectral Platform has a starter training dataset that can be retrieved via Spectral CLI.

Spectral Python SDK provides Data Wrappers, which help facilitate fetching real-time data that can be used to participate in challenges (see Modeler Handbook's Step 7).

Details

  • Fetch the training data through the Spectral SDK

  • If required and deemed suitable, perform any data processing and/or additional feature engineering operations on top of the provided training dataset

  • [Optional] The Modeler can use any other external data if they wish to. However, it is imperative that:

    • any data other than that provided by the Spectral SDK should be readily available in a timely manner during the Consumption Window

    • modelers retain all the necessary scripts pertaining to data fetching, feature engineering, feature selection, etc. as they will be required during Model Validation and Consumption phases

Last updated