The operation of synchrotron light sources with short electron bunches increases the emitted CSR power in the THz frequency range. However, the spatial compression leads to complex longitudinal dynamics, causing the formation of micro-structures in the longitudinal bunch profiles. The fast temporal variation and small scale of these micro-structures put challenging demands on their observation. At the KIT storage ring KARA (KArlsruhe Research Accelerator), diagnostics have been developed allowing direct observation of the dynamics by an electro-optical setup, and indirect observation by measuring the fluctuation of the emitted CSR. In this contribution, we present studies of the micro-structure dynamics on simulated data, obtained using the numerical Vlasov-Fokker-Planck solver Inovesa, and first applications on measured data. To deal with generated data sets in the order of terabytes in size, we apply the machine learning technique k-means to identify the dominant micro-structures in the longitudinal bunch profiles. Following this approach, new insights on the correlation of the CSR power fluctuation to the underlying longitudinal dynamics can be gained