Is using strings as an object identifier bad practice?

Question

I am developing a small app for managing my favourite recipes. I have two classes - Ingredient and Recipe. A Recipe consists of Ingredients and some additional data (preparation, etc). The reason i have an Ingredient class is, that i want to save some additional info in it (proper technique, etc). Ingredients are unique, so there can not be two with the same name.

Currently i am holding all ingredients in a "big" dictionary, using the name of the ingredient as the key. This is useful, as i can ask my model, if an ingredient is already registered and use it (including all it's other data) for a newly created recipe.

But thinking back to when i started programming (Java/C++), i always read, that using strings as an identifier is bad practice. "The Magic String" was a keyword that i often read (But i think that describes another problem). I really like the string approach as it is right now. I don't have problems with encoding either, because all string generation/comparison is done within my program (Python3 uses UTF-8 everywhere if i am not mistaken), but i am not sure if what i am doing is the right way to do it.

Is using strings as an object identifier bad practice? Are there differences between different languages? Can strings prove to be an performance issue, if the amount of data increases? What are the alternatives?

jsbueno · Accepted Answer · 2016-02-03 19:44:46Z

3

No - actually identifiers in Python are always strings. Whether you keep then in a dictionary yourself (you say you are using a "big dictionary") or the object is used programmaticaly, with a name hard-coded into the source code. In this later case, Python creates the name in one of its automaticaly handled internal dictionary (that can be inspected as the return of globals() or locals()).

Moreover, Python does not use "utf-8" internally, it does use "unicode" - which means it is simply text, and you should not worry how that text is represented in actual bytes.

answered Feb 3, 2016 at 19:44

jsbueno

113k11 gold badges159 silver badges239 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Luca Fülbier Over a year ago

I always forget how much Python uses dicts internally, good answer! I get that Python3 uses unicode (Python2 had a class for that if im not mistaken), but isn't the actual encoding used UTF-8?

jsbueno Over a year ago

No, for perfomance reasons, the internal encoding of text has to have a fixed size per character. It is "UCS4" encoding, but there are optimizations in place to use a more compact encoding, depending on each string.

Community · Accepted Answer · 2017-05-23 12:31:01Z

1

Python relies on dictionaries for many of its core features. For that reason the pythonic default dict already comes with a quite effective, fast implementation "from factory", decent hash, etc.

Considering that, the dictionary performance itself should not be a concern for what you need (eventual calls to read and write on it), although the way you handle it / store it (in a python file, json, pickle, gzip, etc.) could impact load/access time, etc.

Maybe if you provide a few lines of code showing us how you deal with the dictionary we could provide specific details.

About the string identifier, check jsbueno's answer, he gave a much better explanation then I could do.

edited May 23, 2017 at 12:31

CommunityBot

11 silver badge

answered Feb 3, 2016 at 19:51

Lucas Siqueira

8658 silver badges20 bronze badges

2 Comments

Luca Fülbier Over a year ago

I use the dict quite straight forward. Just {"name": actual_obj, ..} in a factory like object, that supervises all of my objects, so that no object with the same name is created twice.

Lucas Siqueira Over a year ago

Sounds to me like you won't have to worry. In a project of my own I deal with a dictionary of a few thousand entries, each one containing lists and dictionaries, plus a huge multi-lined string for its textual description. The only thing that bothered me was its size on disk, but I manage to solve that with pickle and gzip.

Collectives™ on Stack Overflow

Is using strings as an object identifier bad practice?

2 Answers 2

2 Comments

2 Comments

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

2 Comments

Related