I have some scientific data and wish to find the best region to fit a straight line in. Theoretically, the data should have a constant gradient but other influe
For an university project i need to create an Android application that collects and aggregates data from various supermarket flyers in order to filter deals by
Currently, i am working on a task where we are scraping pages from web and trying to generate labels for each webpage. For that, we have extracted the text data
I am using the following stored procedures (Oracle Database) to export the output of the query to a CSV file CREATE OR REPLACE PROCEDURE run_query(p_sql IN
Can anyone help me please. I need to find the peak and troughs in a data series but with 2 conditions which are: Peak or trough is peak for minimum of n(days) L
I'm getting an NameError in jupyter notebook even after importing numpy as np. Any idea how to go about it will be appreciated %matplotlib inline %config Inline
Actually, there are total 4 tables invoked in this mapping: Market,Cost, A, B, Read_sourceTB_B-----FIL1------->---------JNR4 \ |
Running on AWS and EMR, Jupyter, Pyspark notebook and trying to install a python package "sparse_dot_topn" version 0.2.9 I'm getting an error I don't understand
I have a variety of behavior data in a big Elasticsearch database, and I'd like to do some analysis. In particular, I want to look at repeat behaviors by the ti
How to compute similarity(percentage) between two matrix/arrays. or find the closest array/matrix to a given array, on the basis of how similar their data value
I have two dataframes, and I am struggling to match the unique ids that I created in df1 to df2 based on 'name' and 'version' values. I need to add a column to
I am working on some analytical work and we need to transform data from one source to another and we are using DBT for transformation purpose. one of the data a
I am working on some analytical work and we need to transform data from one source to another and we are using DBT for transformation purpose. one of the data a
In one column of my dataset the assignment of a record to a phase is listed. Phase I (I), Phase II (I), Phase III (I). Each dataset has an ass
I have a CSV file and I want to extract the element in the first row and 3rd column. How might I go about doing this?
I have a df, A B one six two seven three level five one and a dictionary my_dict={1:"one,two",2:"three,four"} I want to replace df.A with my_di
I have diferent dataframes and need to merge them together based on the date column. If I only had two dataframes, I could use df1.merge(df2, on='date'), to do
I have a system of two first order ODEs, which are nonlinear, and hence difficult to solve analytically in a closed form. I want to fit the numerical solution t
I'm trying to get monthly production using group by after converting the unix column into regular timestamp. Can you please tell how to use group by here in the
I would like to query (SQL with joins) CSV files sitting in a network folder for performing data analysis work. I'm not allowed to move the files out of the net