R counting the occurrences of similar rows of data frame

Posted by Matt on Stack Overflow See other posts from Stack Overflow or by Matt
Published on 2010-04-03T20:25:14Z Indexed on 2010/04/03 22:43 UTC
Read the original article Hit count: 386

Filed under:

duplicates

I have data in the following format called DF (this is just a made up simplified sample):

eval.num, eval.count, fitness, fitness.mean, green.h.0, green.v.0, offset.0 random
1         1           1500     1500          100        120        40       232342
2         2           1000     1250          100        120        40       11843
3         3           1250     1250          100        120        40       981340234
4         4           1000     1187.5        100        120        40       4363453
5         1           2000     2000          200        100        40       345902
6         1           3000     3000          150        90         10       943
7         1           2000     2000          90         90         100      9304358
8         2           1800     1900          90         90         100      284333

However, the eval.count column is incorrect and I need to fix it. It should report the number of rows with the same values for (green.h.0, green.v.0, and offset.0) by only looking at the previous rows.

The example above uses the expected values, but assume they are incorrect.

How can I add a new column (say "count") which will count all previous rows which have the same values of the specified variables?

I have gotten help on a similar problem of just selecting all rows with the same values for specified columns, so I supposed I could just write a loop around that, but it seems inefficient to me.

Related posts about dataframe

Pandas Dataframe add rows on top of dataframe

as seen on Stack Overflow - Search for 'Stack Overflow'
I am trying to add blank rows on top of the pandas Dataframe data. Basically, some blank rows and some calculation for each row which contains calculations for Average etc. for that column. Can someone please help me how I can do this? From: A B D E F G H I J 0 -8 10… >>> More
Converting a dataframe to a vector (by rows)

as seen on Stack Overflow - Search for 'Stack Overflow'
I have a dataframe with numeric entries like this one test <- data.frame(x=c(26,21,20),y=c(34,29,28)) How can I get the following vector? > 26,34,21,29,20,28 >>> More
writing to a dataframe from a for-loop in R

as seen on Stack Overflow - Search for 'Stack Overflow'
I'm trying to write from a loop to a data frame in R, for example a loop like this for (i in 1:20) { print(c(i+i,i*i,i/1))} and to write each line of 3 values to a data frame with three columns, so that each iteration takes on a new row. I've tried using matrix, with ncol=3 and filled by rows,… >>> More
R get rid of rows with duplicate attribute

as seen on Stack Overflow - Search for 'Stack Overflow'
hi there I have a big dataframe with columns such as: ID, time, OS, IP Each row of that dataframe corresponds to one entry. Within that dataframe for some IDs serveral entries (rows) exist. I would like to get rid of those multiple rows (obviously the other attributes will differ for the same ID)… >>> More
Replace values in a dataframe based on another factor which contains NA's in R

as seen on Stack Overflow - Search for 'Stack Overflow'
I have a dataframe which contains (among other things) a numeric column with a concentration, and a factor column with a status flag. This status flag contains NA's. Here's an example df<-structure(list(conc = c(101.769, 1.734, 62.944, 92.697, 25.091, 27.377, 24.343, 55.084, 0.335, 23.280),… >>> More

Developer IT