The internal implementation of R's dataset
- by Yin Zhu
I am trying to build a data processing program. Currently I use a double matrix to represent the data table, each row is an instance, each column represents a feature. I also have an extra vector as the target value for each instance, it is of double type for regression, it is of integer for classification.
I want to make it more general. I am wondering how what kind of structure R uses to store a dataset, i.e. the internal implementation in R.