Input sequences contain part of the outputs in CNN-for-Text-Classification #8

ducalpha · 2018-05-30T15:50:20Z

Step [54] "data = [[d.split(':')[1][:-1], d.split(':')[0]] for d in data]" seems to include the sub-category output into the input sequence.
For example, for data line "DESC:def What is ethology ?", data will be ["def What is ethology ?", "DESC"] so the "def" sub-category is included into the input sequence.

I suggest a fix:

# Remove the sub-category (first word) and the '?' at the end.
data = [[d.split(':')[1].split(' ', 1)[1][:-2], d.split(':')[0]] for d in data]

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input sequences contain part of the outputs in CNN-for-Text-Classification #8

Input sequences contain part of the outputs in CNN-for-Text-Classification #8

ducalpha commented May 30, 2018

Input sequences contain part of the outputs in CNN-for-Text-Classification #8

Input sequences contain part of the outputs in CNN-for-Text-Classification #8

Comments

ducalpha commented May 30, 2018