How to read data from xml file in python

Question

I have below xml file data:

<?xml version="1.0" encoding="iso-8859-1" standalone="yes"?>
<rootnode>
  <TExportCarcass>
    <BodyNum>6168</BodyNum>
    <BodyWeight>331.40</BodyWeight>
    <UnitID>1</UnitID>
    <Plant>239</Plant>
    <pieces>
      <TExportCarcassPiece index="0">
        <Bruising>0</Bruising>
        <RFIDPlant></RFIDPlant>
      </TExportCarcassPiece>
      <TExportCarcassPiece index="1">
        <Bruising>0</Bruising>
        <RFIDPlant></RFIDPlant>
      </TExportCarcassPiece>
    </pieces>
  </TExportCarcass>
  <TExportCarcass>
    <BodyNum>6169</BodyNum>
    <BodyWeight>334.40</BodyWeight>
    <UnitID>1</UnitID>
    <Plant>278</Plant>
    <pieces>
      <TExportCarcassPiece index="0">
        <Bruising>0</Bruising>
        <RFIDPlant></RFIDPlant>
      </TExportCarcassPiece>
      <TExportCarcassPiece index="1">
        <Bruising>0</Bruising>
        <RFIDPlant></RFIDPlant>
      </TExportCarcassPiece>
    </pieces>
  </TExportCarcass>
</rootnode>

I am using python's lxml module to read data from xml file like below:

from lxml import etree

doc = etree.parse('file.xml')

memoryElem = doc.find('BodyNum')
print(memoryElem)

But its only printing None instead of 6168. Please suggest what I am doing wrong here.

Rakesh · Accepted Answer · 2019-12-03 12:49:26Z

2

You need to iterate each TExportCarcass tag and then use find to access BodyNum

Ex:

from lxml import etree

doc = etree.parse('file.xml')
for elem in doc.findall('TExportCarcass'):
    print(elem.find("BodyNum").text)

Output:

6168
6169

or

print([i.text for i in doc.findall('TExportCarcass/BodyNum')]) #-->['6168', '6169']

answered Dec 3, 2019 at 12:49

Rakesh

82.9k17 gold badges85 silver badges122 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

moebius · Accepted Answer · 2019-12-03 13:01:58Z

2

When you run find on a text string, it will only search for elements at the root level. You can instead use xpath queries within find to search for any element within the doc:

To get the first element only:

from lxml import etree
doc = etree.parse('file.xml')

memoryElem = doc.find('.//BodyNum')
memoryElem.text
# 6168

To get all elements:

[ b.text for b in doc.iterfind('.//BodyNum') ]
# ['6168', '6169']

answered Dec 3, 2019 at 13:01

moebius

2,27313 silver badges21 bronze badges

Comments

O Yahya · Accepted Answer · 2019-12-03 13:03:05Z

2

1 - Use / to specify the tree level of the element you want to extract

2 - Use .text to extract the name of the elemnt

doc = etree.parse('file.xml')
memoryElem = doc.find("*/BodyNum") #BodyNum is one level down
print(memoryElem.text)  #Specify you want to extract the name of the element

answered Dec 3, 2019 at 13:03

O Yahya

3762 silver badges7 bronze badges

Comments

Faizan Naseer · Accepted Answer · 2019-12-03 12:48:07Z

0

Just use the inbuild xml.etree.Etree module of python

https://docs.python.org/3/library/xml.etree.elementtree.html

answered Dec 3, 2019 at 12:48

Faizan Naseer

6174 silver badges12 bronze badges

Comments

RomanPerekhrest · Accepted Answer · 2019-12-03 12:51:18Z

0

Your document contains multiple BodyNum elements.
You need to put an explicit limit into a query if you need only the 1st element.

Use the following flexible approach based on xpath query:

from lxml import etree

doc = etree.parse('file.xml')
memoryElem = doc.xpath('(//BodyNum)[1]/text()')
print(memoryElem)   # ['6168']

answered Dec 3, 2019 at 12:51

RomanPerekhrest

93.1k4 gold badges75 silver badges112 bronze badges

4 Comments

S Andrew Over a year ago

Is it possible to get the number of TExportCarcass

S Andrew Over a year ago

Sure thanks, I thought we can use comment section to ask for extra information.

RomanPerekhrest Over a year ago

@SAndrew, Are you sure that this approach worth to be silly downvoted?

moebius Over a year ago

This is also a valid answer. Not sure why it was downvoted

Collectives™ on Stack Overflow

How to read data from xml file in python

5 Answers 5

Comments

Comments

Comments

Comments

4 Comments

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

Comments

Comments

Comments

4 Comments

Related