'How to improve Python's code speed - Panda Dataframe

I'm new to python and panda, i have a huge dataframe of 2 000 000 rows and 2 columns (small example below)

RECEIVER	TRANSMITTER
UEXRT4E	458ERT56
URTU5FE	458ERT57

And an other one with 304 Receivers.

RECEIVER	Number Received
UEXRT4E	25002
URTU5FE	15004
UFTX5FE	10500

I want to delete every transmitters received by each receivers in the huge dataframe.

To do so I have 2 while running. One to get all transmitters receive by receiver n°1 then 2 etc...

Then an other one to delete all the rows where i see the transmitters.

Variable "u" starts at 0.

 while(u<VCbrutD): 
    uE = CbrutD['TRANSMITTER'].iloc[u] #get the transmitter
    Cbrut.drop(Cbrut[Cbrut['TRANSMITTER'] == uE].index, inplace = True) #delete every input of it 
    u = u + 1
    print(u)

VcbrutD is the max lenght of the previous request where i get all the transmiters for receiver n°1 in my 304 receivers's list.

Cbrut is my huge dataframe with 2 000 000 rows.

First loop isn't taking too much time for getting all the transmitter for one receiver, but unfortunately the second while is taking years to compute.

Moreover i do not want to delete everything at the same time as it is important to start from the receiver with the highest number received then the second one etc..

Any idea how to improve this loop ?

Thanks everyone !

python pandas

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution	Source

'How to improve Python's code speed - Panda Dataframe

Sources

Related Questions