Category "pandas"

Maintaining the order of the elements in a frozen set

I have a list of tuples, each tuple of which contains one string and two integers. The list looks like this: x = [('a',1,2), ('b',3,4), ('x',5,6), ('a',2,1)]

How to sort Plotly bar chart in descending order

I have created a basic bar chart in plotly that I would like to sort by descending order. I couldn't find an easy way to specify this in the plotly syntax, so

Melting pandas data frame with multiple variable names and multiple value names

How can I melt a pandas data frame using multiple variable names and values? I have the following data frame that changes its shape in a for loop. In one of the

ERR_CONNECTION_REFUSED on browser when opening dtale with Eclipse Pydev

Opening a dtale sheet using Eclipse Pydev on Windows leads to ERR_CONNECTION_REFUSED on browser. The same code works on spyder and jupyter however. I know dtale

Python Pandas - Concat dataframes with different columns ignoring column names

I have two pandas.DataFrames which I would like to combine into one. The dataframes have the same number of columns, in the same order, but have column headings

Python Pandas - Concat dataframes with different columns ignoring column names

I have two pandas.DataFrames which I would like to combine into one. The dataframes have the same number of columns, in the same order, but have column headings

Python : Changing the original data using a for loop

I have some really big txt files (> 2 gb) where the quality of the data is not good. In some columns (that should be integer), for values below 1000.00 , '.'

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x96 in position 35: invalid start byte

I am new to Python, I am trying to read csv file using below script. Past=pd.read_csv("C:/Users/Admin/Desktop/Python/Past.csv",encoding='utf-8') But, getting

How to split a DataFrame based on consecutive index?

I have a DataFrame 'work' with non consecutive index, here is an example: Index Column1 Column2 4464 10.5 12.7 4465 11.3 12.8 4466 10.3 22.8 5123 1

Getting alternating results with pandas melt

I was trying to convert the first image in this album into the second with pandas but all I got was the third one... Original Year Jan Feb Mar A

Passing dataframe and using its name to create the csv file

I have a requirment where i need to pass different dataframes and print the rows in dataframes to the csv file and the name of the file needs to be the datafram

Slice DataFrame using Dates from a List and its offset as a range for slicing in a for loop

This my DataFrame df with calendar days frequency and DateTime Object as Index. This data starts from 1989-01-03 till present day: Pri

pandas dataframe replace blanks with NaN

I have a dataframe with empty cells and would like to replace these empty cells with NaN. A solution previously proposed at this forum works, but only if the ce

How to reorder indexed rows based on a list in Pandas data frame

I have a data frame that looks like this: company Amazon Apple Yahoo name A 0 130 0 C 173 0 0 Z 0 0

Find the column name which has the maximum value for each row

I have a DataFrame like this one: In [7]: frame.head() Out[7]: Communications and Search Business General Lifestyle 0 0.745763 0.050847 0.118644

Count most frequent 100 words from sentences in Dataframe Pandas

I have text reviews in one column in Pandas dataframe and I want to count the N-most frequent words with their frequency counts (in whole column - NOT in single

How To Solve KeyError: u"None of [Index([..], dtype='object')] are in the [columns]"

I'm trying to create a SVM model from what I found in github here, but it keeps returning this error. Traceback (most recent call last): File "C:\Users\Me\Do

'function' object has no attribute 'apply'

I have a data frame df , which has a column 'query' having text data. I am trying to clean text data with the help of apply function. But getting the above er

Filter rows in csv file based on another csv file and save the filtered data in a new file

Good day all so I was trying to filter file2 based on file1, where file1 is a subset from file2. But file2 has a description column that I need to be able to an

In Pandas, how to return the id for the next value which is above/below a threshold

I have a dataframe like this: date value 0 2018-05-15 06:00:00 100.86 1 2018-05-15 07:00:00 101.99 2 2018-05-15 08:00:00 110.00 3 201