5 sources ---------- TID: a unique record's id (in the complete dataset). CID: cluster id (records having the same CID are duplicate) CTID: a unique id within a cluster (if two records belong to the same cluster they will have the same CID but different CTIDs). These ids (CTID) start with 1 and grow until cluster size. SourceID: identifies to which source a record belongs (there are five sources). The sources are deduplicated. Id: the original id from the source. Each source has its own Id-Format. Uniqueness is not guaranteed!! (can be ignored). number: track or song number in the album. length: the length of the track. artist: the interpreter (artist or band) of the track. year: date of publication. language: language of the track.