TypeError: '<' not supported between instances of 'Example' and 'Example' #474

kidman99 · 2018-11-12T04:22:40Z

Got the error when running the following code. Is there anything similar to an operator overloading for "<" needed here, or there is a go around way here?

from torchtext.data import TabularDataset
from torchtext import data
from torchtext.vocab import GloVe
from torchtext.vocab import GloVe

tv_datafields = [("id", None), # we won't be needing the id, so we pass in None as the field
("question_text", TEXT),
("target", LABEL)]

trn = TabularDataset.splits(
path="data/quora", # the root directory where the data lies
train='train.csv',
format='csv',
skip_header=True, # if your csv header has a header, make sure to pass this to ensure it doesn't get proceesed as data!
fields=tv_datafields)

TEXT.build_vocab(trn, vectors=GloVe(name='6B', dim=300))

tu-artem · 2019-01-15T13:06:15Z

.splits() returns a tuple of datasets, in your case it is of length 1. So

trn = TabularDataset.splits(
...
...
...
fields=tv_datafields)[0]

should work here or you can use a regular TabularDataset constructor instead.

cheryllwl · 2019-01-21T09:38:47Z

I had the same problem with TabularDataset too
http://mlexplained.com/2018/02/08/a-comprehensive-tutorial-to-torchtext/
This tutorial was helpful.

added these two lines and it worked like a charm

mttk · 2019-01-31T21:33:22Z

thanks @cheryllwl , this should be documented properly.

kunjmehta · 2019-10-04T17:36:16Z

@tu-artem Can you please elaborate on what adding the index [0] does?
From what I gather the splits() method returns a Dataset object as a tuple containing Example objects (instances/rows)
So, if I write;
train, val = torchtext.data.TabularDataset.splits(path='./', train = "train.csv", test = "test.csv", format='csv', fields=data_fields, skip_header = True)
I will get a Dataset object which is a tuple containing all training instances in train variable and another Dataset object containing all test instances in val variable. Am I right?
In this case, please help me understand what the indexing [0] does. Thanks.

tu-artem · 2019-10-04T18:32:45Z

@kunjmehta in your case you are already doing tuple unpacking via multiple assignment train, val = ..., so you don't need any further indexing

aaronbriel · 2020-01-25T01:05:53Z

What worked for me was to simply add sort=False, as sorting was not needed in my case.

Sandesh10 · 2020-02-19T22:48:34Z

What worked for me was to simply add sort=False, as sorting was not needed in my case.

This worked for me too. I added sort=False as a parameter in the BucketIterator.

mttk added the docs label Jan 31, 2019

Jun	JUL	Aug
	27
2019	2020	2021

pytorch / text

TypeError: '<' not supported between instances of 'Example' and 'Example' #474

TypeError: '<' not supported between instances of 'Example' and 'Example' #474

kidman99 commented Nov 12, 2018

tu-artem commented Jan 15, 2019

cheryllwl commented Jan 21, 2019

mttk commented Jan 31, 2019

kunjmehta commented Oct 4, 2019 •

edited

tu-artem commented Oct 4, 2019

aaronbriel commented Jan 25, 2020

Sandesh10 commented Feb 19, 2020 •

edited

pytorch / text

Join GitHub today

TypeError: '<' not supported between instances of 'Example' and 'Example' #474

TypeError: '<' not supported between instances of 'Example' and 'Example' #474

Comments

kidman99 commented Nov 12, 2018

tu-artem commented Jan 15, 2019

cheryllwl commented Jan 21, 2019

mttk commented Jan 31, 2019

kunjmehta commented Oct 4, 2019 • edited

tu-artem commented Oct 4, 2019

aaronbriel commented Jan 25, 2020

Sandesh10 commented Feb 19, 2020 • edited

kunjmehta commented Oct 4, 2019 •

edited

Sandesh10 commented Feb 19, 2020 •

edited