Data prepration is very important step in many application in this example we are aiming to prepare our data for seq2seq model training.
In this project we will prepare a text based data for further use.
This contains analyzing dataset, clearing it and splitting to designated columns.
Starting csv file made of one column where every rows is separated report. We will be getting parts of those reports and split parts onto columns.
csv file is poory structured so it will help us better understand methods.
For data augmentation and ai training separte repos will be shared.
Text Data Augmentation repo -> https://github.com/onuralpArsln/AugmentTextData