'Python Pandas groupby with agg() nth() and/or iloc()

Given this DF:

df = pd.DataFrame({'Col1':['A','A','A','B','B','B','B'] 
                  , 'Col2':['i', 'j', 'k', 'l', 'm', 'n', 'o']
                  , 'Col3':['Apple', 'Peach', 'Apricot', 'Dog', 'Cat', 'Mouse', 'Horse']
                  ,})

df

And then using this code:

df1 = df.groupby('Col1').agg({'Col2':'count', 'Col3': lambda x: x.iloc[2]})
df1

I got this result:

What I would like now:

Being able to make the lambda function 'Col3': lambda x: x.iloc[0] to print('Not enough data') when dealing with error for example if I change "x.iloc[0]" to "x.iloc[3]" which raised an error because there's not enough data in "Col1['A'] compared to "Col1['B']".

!! Don't want to use 'last' because this is a simplified and shortened DF for purpose !!

Solution 1:^[1]

You can try with a slice object which will return empty Series if none value.

df1 = df.groupby('Col1').agg({'Col2':'count',
                              'Col3': lambda x: x.iloc[3:4] if len(x.iloc[3:4]) else pd.NA})

print(df1)

      Col2   Col3
Col1
A        3   <NA>
B        4  Horse

You can save typing with named expression if your Python version is greater than 3.8

df1 = df.groupby('Col1').agg({'Col2':'count',
                              'Col3': lambda x: v if len(v := x.iloc[3:4]) else pd.NA})

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution	Source
Solution 1

'Python Pandas groupby with agg() nth() and/or iloc()

Solution 1:[1]

Sources

Related Questions

Solution 1:^[1]