File size: 999 Bytes
5fa1a76 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
Remove the other columns with the [~datasets.Dataset.remove_columns] method: minds = minds.remove_columns(["english_transcription", "intent_class", "lang_id"]) Take a look at the example again: minds["train"][0] {'audio': {'array': array([-0.00024414, 0. , 0. , , 0.00024414, 0.00024414, 0.00024414], dtype=float32), 'path': '/root/.cache/huggingface/datasets/downloads/extracted/f14948e0e84be638dd7943ac36518a4cf3324e8b7aa331c5ab11541518e9368c/en-US~APP_ERROR/602ba9e2963e11ccd901cd4f.wav', 'sampling_rate': 8000}, 'path': '/root/.cache/huggingface/datasets/downloads/extracted/f14948e0e84be638dd7943ac36518a4cf3324e8b7aa331c5ab11541518e9368c/en-US~APP_ERROR/602ba9e2963e11ccd901cd4f.wav', 'transcription': "hi I'm trying to use the banking app on my phone and currently my checking and savings account balance is not refreshing"} There are two fields: audio: a 1-dimensional array of the speech signal that must be called to load and resample the audio file. |