Category "pandas-groupby"

Summing row values after a groupby but based on a dictionary condition?

I am trying to figure out how to add row entries of the numeric columns(supply,demand) . I am at a complete loss. My initial thoughts are to do this with a dic

Pandas to read a excel file from s3 and apply some operation and write the file in same location

i am using pandas to read an excel file from s3 and i will be doing some operation in one of the column and write the new version in same location. Basically ne

Groupby id and change values for all rows for the earliest date to NaN

I have the following id, i would like to groupby id and then replace value X with NaN. My current df. ID Date X other variables.. 1 1/1/18

Pandas: Values to columns and then group and merge by same Id [duplicate]

I have a dataframe like this df = DataFrame({'Id':[1,2,3,3,4,5,6,6,6], 'Type': ['T1','T1','T2','T3','T2','T1','T1','T2','T3'],

Pandas DataFrame : How to groupby and sort "by blocks"?

I'm working with a DataFrame containing data as follows, and group the data two different ways. >>> d = { "A": [100]*7 + [200]*7, "B": ["one"

Pandas pick the higher value for each unique id

I have a df of customers CUST_ID | SEGMENT | AREA 1 | B | CAD 1 | A | RAM 2 | B | CAD 2 | C | RAM 3 | B

Pandas groupby feature question for output CSV

I have the following code df.groupby('AccountNumber')[['TotalStake','TotalPayout']].sum() which displays as I would like it to in pandas The issue is when I ou

Pandas - Cross referencing with DatetimeIndex - Groupby

I have data of many companies by month (End of Month). I want to create a new columns with groupby for each company where: new_col from Jul of this year to Jun

Calculate Mean Absolute Error for each row of a Pandas dataframe

Below is a sample of pandas dataframe that I'm working with. I want to calculate mean absolute error for each row but only considering relevant columns for valu

Groupby hours +/- some integer of additional hours

I have a data frame consisting of some columns, where the index is datetime, i.e. it looks something like: df = col1 col2

Extracting specific number of rows from dataframe

I have a csv file having two columns i.e. imagename and ID. There are multiple image names for same ID as shown in picture. Number of image names against id is

Error in creating dynamic columns from existing column having nested list of lists

I want to create two column from an existing column which contains nested list of list as values. Rows of record consisting of 3 companies participant and their

Calculations on a pandas DataFrame column conditional on another column

I notice several 'set value of new column based on value of another'-type questions, but from what I gather, I have not found that they address dividing values

changing frequency in a pandas SeriesGroupBy

I'm struggling to find a simple way to change a frequency of a pd.Series that is grouped on some level of a pd.MultiIndex (so it's a pd.core.groupby.generic.Ser

Multiple aggregations of the same column using pandas GroupBy.agg()

Is there a pandas built-in way to apply two different aggregating functions f1, f2 to the same column df["returns"], without having to call agg() multiple times

Adding new column based on combined criteria in Pandas Groupby

Following on from my previous question (thanks to those responding) I'm stuck again in achieving what I suspect is possible using a groupby in Pandas. Here's wh

generate dict from datarame with grouping columns

I try to generate a json file or dict rom my datframe (grouping the columns) my datFrame is df1 = pd.DataFrame({ 'USER': ['ALL','ALL','BOB','STEVE',

generate dict from datarame with grouping columns

I try to generate a json file or dict rom my datframe (grouping the columns) my datFrame is df1 = pd.DataFrame({ 'USER': ['ALL','ALL','BOB','STEVE',

Add a new logic in pyhton

Want to add logic that calculates and outputs truckloads able to be built each day. Still want this broken out by ship-to party (so 1 ship-to party per shipment

How to divide a groupby Object by pandas Series efficiently? Or how to convert yfinance multiple ticker data to another currency?

I am pulling historical price data for the S&P500 index components with yfinance and would now like to convert the Close & Volume from USD into EUR. Thi