Category "pandas"

Selecting data from a pandas DataFrame

I have defined a pandas DataFrame, given the number of rows (index) and columns. I perform a series of operations and store the data in such DataFrame. The code

How can I group elements in pandas series based on how many times they repeat?

I have this example_series: 0 False 1 False 2 False 3 False 4 False 5 False 6 False 7 False 8 False 9 False 10 False

Assign outcome from SQL query to column

I have a dataframe (test_df) that looks like this: dq_code dq_sql Results ID_24 select 'A' as B, 'B' as

Python pandas read_fwf strips white space

I am facing an issue using the read_fwf command from the Python library pandas, same as described in this unresolved question I want to read an ascii file conta

Converting a pandas dataframe to multi-index and changing values

I have the following dataframe: d = [{'AX':['Rec=1','POSi=2'], 'AVF1':[], 'HI':['Rec=343', 'POSi=4'], 'version_1':[]}, {'AX':[], 'AVF1':['Rec=4', 'POSi=454'],

Calculate the pair-wise correlation between distinct class pairs over two feature columns and the target variable?

Most similar questions relating to calculating this involve a single correlation value for each feature column, showing how the features in a dataset correlate

Having trouble expanding/normalizing a dataframe column of dictionary values into a dataframe/ other columns

I'm trying to expand a dataframe column of dictionaries into it's own dataframe/other columns. I have already tried using json_normalize, iteration, and list c

Split / Explode a column of dictionaries into separate columns with pandas

I have data saved in a postgreSQL database. I am querying this data using Python2.7 and turning it into a Pandas DataFrame. However, the last column of this dat

Why does dask take long time to compute regardless of the size of dataframe

What is the reason that dask dataframe takes long time to compute regardless of the size of dataframe. How to avoid this from happening ? What is the reason beh

DF with values for Time Intervals

I am trying to make a manual dataframe.. I would like to have a time stamp with a time interval, for example: df1: Time Interval Price 10:00 - 11:00 $15 11:00

using pandas to replace header of dataframe

I have an XYZ file in the following format X[m] Y[m] DensD_1200c[m] 625268.27 234978.67 7.24 625268.34 234978.52 7.24 625268.38

Pandas rolling average of a columns of dates

I'm trying to calculate the rolling average of a column of datetime objects. In my scenario, the input data are the last day below freezing each year for ~100 y

How to iterate over rows of each column in a dataframe

My current code functions and produces a graph if there is only 1 sensor, i.e. if col2, and col3 are deleted in the example data provided below, leaving one col

geopandas doesn't find point in polygon even though it should?

I have some lat/long coordinates and need to confirm if they are with the city of Atlanta, GA. I'm testing it out but it doesn't seem to work. I got a geojson f

Google colab: Read .xlsx file in from Github pandas

from Google Colab, I am trying to create a df from a xlsx file I have on a Github repo. As url I take the permalink from Github, the repo is public and account

How to plot some datasets in pandas based on different thresholds in python

I have a data frame that has 3 columns and I want to plot a line graph based on some thresholds. Here is the data frame date income ratio 0 2022-0

Classify DataFrame rows based on first matching condition

I have a pandas DataFrame, each column represents a quarter, the most recent quarters are placed to the right, not all the information gets at the same time, so

How to export Pandas DataFrame to HTML but without any formatting?

I want to export a DF with Pandas to an HTML formatted table, but I don't want any of the default styling that Pandas does to its tables, and would prefer just

Renaming identical column names in Pandas [duplicate]

I have a cycle_2 df with the following column names: 3ls 3rs 3ls 3rs 3 absolute_cost 3.00 9.40 9.40

Parse Year Week columns to Date

I have a data frame with columns Year and Week that I am trying to parse to date into a new column called Date. import datetime df['Date']=datetime.datetime.fro