subset complete or balance dataset in r

Posted by SHRram on Stack Overflow See other posts from Stack Overflow or by SHRram
Published on 2012-12-19T17:02:22Z Indexed on 2012/12/19 17:03 UTC
Read the original article Hit count: 169

Filed under:

r

|

dataset

I have a dataset that unequal number of repetition. I want to subset a data by removing those entries that are incomplete (i.e. replication less than maximum). Just small example:

set.seed(123)
mydt <- data.frame (name= rep ( c("A", "B", "C", "D", "E"), c(1,2,4,4, 3)), 
                   var1 = rnorm (14, 3,1), var2 = rnorm (14, 4,1))
 mydt
       name     var1     var2
1     A 2.439524 3.444159
2     B 2.769823 5.786913
3     B 4.558708 4.497850
4     C 3.070508 2.033383
5     C 3.129288 4.701356
6     C 4.715065 3.527209
7     C 3.460916 2.932176
8     D 1.734939 3.782025
9     D 2.313147 2.973996
10    D 2.554338 3.271109
11    D 4.224082 3.374961
12    E 3.359814 2.313307
13    E 3.400771 4.837787
14    E 3.110683 4.153373

summary(mydt)

name       var1            var2      
 A:1   Min.   :1.735   Min.   :2.033  
 B:2   1st Qu.:2.608   1st Qu.:3.048  
 C:4   Median :3.120   Median :3.486  
 D:4   Mean   :3.203   Mean   :3.688  
 E:3   3rd Qu.:3.446   3rd Qu.:4.412  
       Max.   :4.715   Max.   :5.787

I want to get rid of A, B, E from the data as they are incomplete. Thus expected output:

name     var1     var2
4     C 3.070508 2.033383
5     C 3.129288 4.701356
6     C 4.715065 3.527209
7     C 3.460916 2.932176
8     D 1.734939 3.782025
9     D 2.313147 2.973996
10    D 2.554338 3.271109
11    D 4.224082 3.374961

Please note the dataset is big, the following may not a option:

mydt[mydt$name == "C",]
mydt[mydt$name == "D", ]

Related posts about dataset

Dataset -> XML Document - Load DataSet into an XML Document - C#.Net

as seen on Stack Overflow - Search for 'Stack Overflow'
Hello I'm trying to read a dataset as xml and load it into an XML Document. XmlDocument contractHistoryXMLSchemaDoc = new XmlDocument(); using (MemoryStream ms = new MemoryStream()) { //XmlWriterSettings xmlWSettings = new XmlWriterSettings(); //xmlWSettings.ConformanceLevel = ConformanceLevel… >>> More
Combine multiple dataset columns to one dataset

as seen on Stack Overflow - Search for 'Stack Overflow'
I have multiple datasets that I would like to combine into one. There is a common ID field that can be associated to each row. Calling Merge on the dataset will add additional rows to the dataset, but I would like to combine the additional columns. There are too many fields to do this in one query… >>> More
Making a DataSet from another DataSet

as seen on Stack Overflow - Search for 'Stack Overflow'
Hi folks I have a client-server project (small project for companies in C#) and the server has a DataSet with some tables (there is no Database for some reasons so we save the DataSet as an XML file). when the clients connect to the server, the server should send some informations to the client depends… >>> More
Declaring variables with New DataSet vs DataSet

as seen on Stack Overflow - Search for 'Stack Overflow'
What is the impact of creating variables using: Dim ds as New DataSet ds = GetActualData() where GetActualData() also creates a New DataSet and returns it? Does the original empty DataSet that was 'New'ed just get left in the Heap? What if this kind of code was in many places? Would… >>> More
DataSet.Copy vs Dataset.Clone

as seen on Stack Overflow - Search for 'Stack Overflow'
Can someone explain me DataSet.Copy vs Dataset.Clone Also let me know some scenario's where we can use these >>> More

Developer IT

subset complete or balance dataset in r - Developer IT

subset complete or balance dataset in r

r

dataset

Related posts about r

Related posts about dataset

Dataset -> XML Document - Load DataSet into an XML Document - C#.Net

Combine multiple dataset columns to one dataset

Making a DataSet from another DataSet

Declaring variables with New DataSet vs DataSet

DataSet.Copy vs Dataset.Clone

Categories cloud