Apr 2, 2021
Hi Li,
Datahub deals with metadata, not the data itself. In your example, your metadata in Json format and XML format will be treated differently. Datahub’s Dataset consists of three parts: platform, dataset-name and origin. Let’s say you save your JSON data as amazon S3 file, so your platform for it would be S3. origin means TEST, PROD OR STAGING. If your XML & JSON dataset shares the same schema (even though JSON doesn’t have a schema), you can differentiate them in InstitutionalMemeory aspect of this Dataset. If you don’t want to differentiate them, you can create two datasets. (Dataset is an entity of Datahub)