Category "statistics"

How do I pass arguments to srvyr inside of a function?

so I'm using srvyr to calculate survey means of a variable (y) from a survey object, grouping by a categorical variable (x) from that same survey object, and th

R: T-statistics with subsets

I would like to have a table as ouptut where I have the t-statistics between the difference in means of certain variables and based on two specific subsets of m

lme4 1.1-27.1 error: pwrssUpdate did not converge in (maxit) iterations

Sorry that this error has been discussed before, each answer on stackoverflow seems specific to the data I'm attempting to run the following negative binomial m

Github Repos are not showing the Languages statistics

I have the followings Githubs repos which are not showing the Languages used for the projects. I mean that the following bar that represent the percentage of di

Get chi squared from cartesian points

I have a table with x, y columns x y 10 12 23 24 12 43 34 15 74 18 ... How can I know if they have a random distribution or if there is a dependency bet

Why is the mean = 0 to calculate the confidence intervals of a distribution when using stats.norm?

I somewhat understand how to calculate the Confidence interval in this manner but why is it that in this code, they used the mean=0 within the stats.norm se = n

Calculate Decay Rate in Python

I have dataset which somewhat follows an exponentional decay df_A Period Count 0 1600 1 894 2 959 3 773 4 509 5 206 I want

ClassValuePicker in Weka KnowledgeFlow doesn't seem to let you pick a class

Does anyone have any insight? New to using Weka KnowledgeFlow but I've followed the official documentation to a t. I'm trying to generate multiple ROC plots, wh

How to find the sum given n, 1st score and last score?

Lets say you have a column called "values" and "no.of times". You have thousands of rows but you don't want to scrape the data from the site.. You have to calcu

replacing missing values in r based on adjacent columns value

I nee help in filling out the missing values. The following rule need to be followed when replacing a missing value. If the value is the same on both sides of t

Getting error while generating heatmap in python:ValueError: Must pass 2-d input. shape=()

I am getting errors while generating maps of h3 moments in python for an image file called "image_test". The error apparently is for the shape of the values tha

Trying to replicate figures from Bayesian statistics without tears: A sampling-resampling perspective, but failed

I'm trying to replicate the three figures from the paper Bayesian statistics without tears: A sampling-resampling perspective, which can be fo

Calculating Mean Squared Error with Sample Mean

I was given this assignment, and I'm not sure if i understand the question correctly. We considered the sample-mean estimator for the distribution mean. Anothe

Random proababilty of rolling down to 1 from 1000 [closed]

If I roll a random number between 1 and a 1000, then use this random number as the new range from 1 and new number for example if it rolled 50

Causal Inference where the treatment assignment is randomised

I have mostly worked with Observational data where the treatment assignment was not randomized. In the past, I have used PSM, IPTW to balance and then calculate

fisher.test crash R with * caught segfault * error

As title said, fisher.test crash R with *** caught segfault *** error. Here is the code to produce the error: d<-matrix(c(1,0,5,2,1,90,0,0,0,1,0,14,0,0,0,0,0

How to plot correlation matrix/heatmap with categorical and numerical variables

I have 4 variables of which 2 variables are nominal (dtype=object) and 2 are numeric(dtypes=int and float). df.head(1) OUT: OS_type|Week_day|clicks|avg_app_s

How to get multiple combinations of multiple lists in python (Multiple n Choose K or nCr)

I have been looking on google and stack overflow for a few hours and I am sure there is an answer for what this is mathematically or perhaps it is just what the

Statistical difference between linear regressions

I have a statistical question on which I am stuck: Imagine you have 5 corn fields. You know the number of corn plant there is in each fields. You know want to c

why a specific model is not appropriate, given a data with 6 variables (they are chr variables)

i want to show why a specific model is not appropriate, given a data with 6 variables (they are chr variables) the model is y= abc*(x1+x2) a and b from the data

Category "statistics"

Other Categories