Types
Dataset size reduction can be performed in one of the two ways [5]
- feature set reduction (feature selection)
- sample set reduction.
Traditional dimensionality reduction approaches:
the dimensionality of data increases, the computational cost of traditional dimensionality reduction methods grows exponentially, and the computation becomes prohibitively intractable.
These drawbacks have triggered the development
- random projection reduced time cost
However, the RP transformation matrix is generated without considering the intrinsic structure of the original data and usually leads to relatively high distortion.
Reference List
- https://zhuanlan.zhihu.com/p/159285110?utm_id=0
- https://zhuanlan.zhihu.com/p/62470700?utm_id=0
- Xie, H., Li, J., & Xue, H. (2017). A survey of dimensionality reduction techniques based on random projection. arXiv preprint arXiv:1706.04371.
- Boutsidis, C., Zouzias, A., & Drineas, P. (2010). Random projections for -means clustering. Advances in neural information processing systems, 23.
- Jović, A., Brkić, K., & Bogunović, N. (2015, May). A review of feature selection methods with applications. In 2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO) (pp. 1200-1205). Ieee.