Category "statistics"

Random document in ElasticSearch

Is there a way to get a truly random sample from an elasticsearch index? i.e. a query that retrieves any document from the index with probability 1/N (where N i

How to properly remove redundant components for Scikit-Learn's DPGMM?

I am using scikit-learn to implement the Dirichlet Process Gaussian Mixture Model: https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/mixture/dp

How to use `Dirichlet Process Gaussian Mixture Model` in Scikit-learn? (n_components?)

My understanding of "an infinite mixture model with the Dirichlet Process as a prior distribution on the number of clusters" is that the number of clusters is d

MCAR Little's test in Python

How can I execute Little's Test, to find MCAR in Python? I have looked at the R package for the same test, but I want to do it in Python. Is there an alternate

Does a plug-in selector bivariate kernel density estimator with weights exist for python?

I am trying to calculate the kernel density estimate for a set of weighted bivariate data points. I am currently using KDEpy.FFTKDE. However, this does not prov

How to show the y-axis of seaborn displot as percentage

I'm using seaborn.displot to display a distribution of scores for a group of participants. Is it possible to have the y axis show an actual percentage (example

Equivalent C# Function For Excels Norm.S.Inv Function

I plan on finding the benchmark Z's of some data in C#. For this I need the Norm.S.Inv function from Excel. I am not able to find any sort of implementation for