결과 1
Downloading file 'admiralbob77_-_Choice_-_Drum-bass.ogg' from 'https://librosa.org/data/audio/admiralbob77_-_Choice_-_Drum-bass.ogg' to '/root/.cache/librosa'.
audio:
name: melspectrogram
sample_rate: 16000
frame_length: 20.0
frame_shift: 10.0
del_silence: false
num_mels: 80
apply_spec_augment: true
apply_noise_augment: false
apply_time_stretch_augment: false
apply_joining_augment: false
eval:
use_cuda: true
dataset_path: /content/drive/MyDrive/kodata/groups
checkpoint_path: /content/drive/MyDrive/project/openspeech_fork/outputs/2022-05-07/05-00-18/rnn_transducer-ksponspeech/uzn0w6l3/checkpoints/final.ckpt
manifest_file_path: /content/drive/MyDrive/kodata/eval.txt
num_workers: 4
batch_size: 32
beam_size: 1
model:
model_name: rnn_transducer
encoder_hidden_state_dim: 320
decoder_hidden_state_dim: 512
num_encoder_layers: 4
num_decoder_layers: 1
encoder_dropout_p: 0.2
decoder_dropout_p: 0.2
bidirectional: true
rnn_type: lstm
output_dim: 512
optimizer: adam
tokenizer:
sos_token: <sos>
eos_token: <eos>
pad_token: <pad>
blank_token: <blank>
encoding: utf-8
unit: kspon_character
vocab_path: ../../../aihub_labels.csv
[2022-05-07 14:20:56,561][torch.distributed.nn.jit.instantiator][INFO] - Created a temporary directory at /tmp/tmpenq7mp8s
[2022-05-07 14:20:56,563][torch.distributed.nn.jit.instantiator][INFO] - Writing /tmp/tmpenq7mp8s/_remote_module_non_sriptable.py
100% 94/94 [42:32<00:00, 27.15s/it]
[2022-05-07 15:03:38,644][__main__][INFO] - Word Error Rate: 1.0047503635482307, Character Error Rate: 0.9588281033778352
JavaScript
복사