Search Results

Search found 1631 results on 66 pages for 'statistics'.

Page 14/66 | < Previous Page | 10 11 12 13 14 15 16 17 18 19 20 21 | Next Page >

How can I structure and recode messy categorical data in R?

- by briandk

I'm struggling with how to best structure categorical data that's messy, and comes from a dataset I'll need to clean. The Coding Scheme I'm analyzing data from a university science course exam. We're looking at patterns in student responses, and we developed a coding scheme to represent the kinds of things students are doing in their answers. A…

Read the article
What is the ratio of Java programmers to C#.net programmers?

- by Vaccano

How many Java Programmers are there to every C# programmer? I have a coworker that says it was 3:1 (3 Java to 1 C#) but it is now more like 2:1 (2 java to 1 C#) Is this valid? Is there somewhere I could go for this info? Edit: This question needs to be a bit more limited in scope. I am referring to US programmers and those who would…

Read the article
mysql/algorithm: Weighting an average to accentuate differences from the mean

- by Sai Emrys

This is for a new feature on http://cssfingerprint.com (see /about for general info). The feature looks up the sites you've visited in a database of site demographics, and tries to guess what your demographic stats are based on that. All my demgraphics are in 0..1 probability format, not ratios or absolute numbers or the like. …

Read the article
How to notice unusual news activity

- by ??iu

Suppose you were able keep track of the news mentions of different entities, like say "Steve Jobs" and "Steve Ballmer". What are ways that could you tell whether the amount of mentions per entity per a given time period was unusual relative to their normal degree of frequency of appearance? I imagine that for a more popular…

Read the article
How to use boost normal distribution classes?

- by David Alfonso

Hi all, I'm trying to use boost::normal_distribution in order to generate a normal distribution with mean 0 and sigma 1. The following code uses boost normal classes. Am I using them correctly? #include <boost/random.hpp> #include <boost/random/normal_distribution.hpp> int main() { boost::mt19937 rng; // I…

Read the article
What is the best Java numerical method package?

- by Bob Cross

I am looking for a Java-based numerical method package that provides functionality including: Solving systems of equations using different numerical analysis algorithms. Matrix methods (e.g., inversion). Spline approximations. Probability distributions and statistical methods. In this case, "best" is defined as a…

Read the article
Randomized experiments in R

- by gd047

Here is a simple randomized experiment. In the following code I calculate the p-value under the null hypothesis that two different fertilizers applied to tomato plants have no effect in plants yields. The first random sample (x) comes from plants where a standard fertilizer has been used, while an "improved" one…

Read the article
What to use to create bar, line and pie charts with javascript compatible with all major browsers?

- by marcgg

I used to work with flot but it doesn't support pie charts so I'm forced to change. I just saw JS Charts, but their documentation is very obscure regarding cross browser compatibility (I need it to be IE6+ compliant :). Also this will be for commercial use, so I'd rather have something that I can use free of…

Read the article
R Question. Numeric variable vs. Non-numeric and "names" function

- by Michael

> scores=cbind(UNCA.score, A.score, B.score, U.m.A, U.m.B) > names(scores)=c('UNCA.scores', 'A.scores', 'B.scores','UNCA.minus.A', 'UNCA.minus.B') > names(scores) [1] "UNCA.scores" "A.scores" "B.scores" "UNCA.minus.A" "UNCA.minus.B" > summary(UNCA.scores) X6.69230769230769 Min. …

Read the article
R selecting duplicate rows

- by Matt

Okay, I'm fairly new to R and I've tried to search the documentation for what I need to do but here is the problem. I have a data.frame called heeds.data in the following form (some columns omitted for simplicity) eval.num, eval.count, ... fitness, fitness.mean, green.h.0, green.v.0, offset.0, green.h.1,…

Read the article
Referal links - how does it work ?

- by oneat

Could you explain me? Because in stats I can find Refelar links and I am curious. How are they made? Is it placed somewhere in HTTP request?

Read the article
Screening (multi)collinearity in a regression model

- by aL3xa

I hope that this one is not going to be "ask-and-answer" question... here goes: (multi)collinearity refers to extremely high correlations between predictors in the regression model. How to cure them... well, sometimes you don't need to "cure" collinearity, since it doesn't affect regression model itself,…

Read the article
Naive Bayesian classification (spam filtering) - Doubt in one calculation? Which one is right? Plz c

- by Microkernel

Hi guys, I am implementing Naive Bayesian classifier for spam filtering. I have doubt on some calculation. Please clarify me what to do. Here is my question. In this method, you have to calculate P(S|W) - Probability that Message is spam given word W occurs in it. P(W|S) - Probability that word…

Read the article
incremental way of counting quantiles for large set of data

- by Gacek

I need to count the quantiles for a large set of data. Let's assume we can get the data only through some portions (i.e. one row of a large matrix). To count the Q3 quantile one need to get all the portions of the data and store it somewhere, then sort it and count the quantile: List<double>…

Read the article
Any C++ library for Johansen co-integration test ?

- by Faraz

Any Ideas ? Will be highly appreciated.

Read the article
how to develop a program to minimize errors in human transcription of hand written surveys

- by Alex. S.

I need to develop custom software to do surveys. Questions may be of multiple choice, or free text in a very few cases. I was asked to design a subsystem to check if there is any error in the manual data entry for the multiple choices part. We're trying to speed up the user data entry process and…

Read the article
Screening (multi)collinearity in a reggresion model

- by aL3xa

I hope that this one is not going to be "ask-and-answer" question... here goes: (multi)collinearity refers to extremely high correlations between predictors in the regression model. How to cure them... well, sometimes you don't need to "cure" collinearity, since it doesn't affect regression model…

Read the article
How do you combine "Revision Control" with "WorkFlow" for R?

- by Tal Galili

Hello all, I remember coming across R users writing that they use "Revision control" (e.g: "Source control"), and I am curious to know: How do you combine "Revision control" with your statistical analysis WorkFlow? Two (very) interesting discussions talk about how to deal with the WorkFlow. But…

Read the article
What's the best way to unit test code that generates random output?

- by Flynn1179

Specifically, I've got a method picks n items from a list in such a way that a% of them meet one criterion, and b% meet a second, and so on. A simplified example would be to pick 5 items where 50% have a given property with the value 'true', and 50% 'false'; 50% of the time the method would…

Read the article
R: Forecast package: Automatic algorithm for composite model involving ETS and AR

- by phanikishan

Hey, I would like to write a code involving automatic selection of a best composite model using ETS as well as autoregressive models. What is the criteria I should base my selection on? Also if I'm using the auto.arima function for deducing number of AR terms and corresponding…

Read the article
Significance in R

- by Gemsie

Ok, this is quite hard to explain, but I'm at a complete loss what to do. I'm a relative newcomer to R and although I can completely admire how powerful it is, I'm not too good at actually using it.... Basically, I have some very contrived data that I need to analyse (it wasn't…

Read the article
Is there a good R API for accessing Google Docs?

- by James Thompson

I'm using R for data analysis, and I'm sharing some data with collaborators via Google docs. Is there a simple interface that I can use to access a R data.frame object to and from a Google Docs spreadsheet? If not, is there a similar API in other languages?

Read the article
Summarising grouped records in a dataframe in R (...again)

- by monch1962

Hello all, (I tried to ask this question earlier today, but later realised I over-simplified the question; the answers I received were correct, but I couldn't use them because of my over-simplification of the problem in the original question. Here's my 2nd attempt...) I have…

Read the article
Summarising grouped records in a dataframe in R

- by monch1962

Hello all, I have a data frame in R that looks like this: > TimeOffset, Source, Length > 0 1 1500 > 0.1 1 1000 > 0.2 1 50 > 0.4 2 25 > 0.6 2 3 > 1.1 1 …

Read the article
Discrete problem of probability theory [closed]

- by calejero

A jury consists of 12 persons each of which has, before the trial started, a probability of 0.4 to vote in favor of the defendant's innocence. During the trial, the lawyer has a probability of 0.6 to change the mind of each juror who was biased against the accused. How likely…

Read the article

< Previous Page | 10 11 12 13 14 15 16 17 18 19 20 21 | Next Page >