Report overview¶
This is the overview for the report. The actual report is in the main.tex file in the same folder and constructed from the other .tex files in the Chapters folder. To see the most recent changes in the report, please create a pdf file from the main.tex file or have a look at asparagus-report.pdf.
Task distribution is marked like this.
1 Introduction¶
The project Josefine
Rough summary of the idea of the project. How did the idea come up? What is the project about?
Background on computer vision based classification tasks Sophia
Short introduction into the field of computer vision and short introduction to artificial neural networks. Prospects and challenges of both on a general basis. This part is kept very brief.
Background on sorting asparagus Josefine
What is the main focus during the sorting process, i.e. why do you need to sort asparagus and in which classes do you sort it? What problems and challenges will be met - including the difference of challenge for humans vs. machines. Including the decision tree for the labels as sorted with Silvan’s machine.
2 Data acquisition and organization¶
Timetable (roadmap) of the project Josefine
A (visual) overview of the project’s course during the year. How we planned the project vs. how the actual timetable of the project looked like.
Organisation of the study group Richard
Communication Richard
Deciding on a communication platform and handling it: organization in general, team meetings, Asana, Telegram, and any other virtual form of communication
Teamwork Richard
The experience of working in a team and organising/distributing tasks in a group: task distribution, cooperation on tasks, GitHub: organising our project with Git, Grid: working together with the IKW data store, other teamwork related experiences (could we integrate the strengths and weaknesses of the single team members) …
Data collection Josefine, Richard
How we collected the data and what the data looked like. Here, the process of driving to Rheine and collecting the data with an external hard-drive is described. Understanding the sorting machine and its output. First problems that had to be resolved: labelled vs. unlabelled data ( -> running pre-sorted pieces through the sorting machine, did not (completely) resolve our problem), saving the data manually on an external harddrive ( -> solved by building a script for data transfer, and Teamviewer sessions)
Literature research Josefine
Previous literature research concerning food classification and handling unlabelled data. Searching for background literature close to our project, e.g. automatic CV-based sorting of other food products. Re-reading on potential ANN structures that we could use for sorting. Could we rely on a certain paper/process? Did it work?
3 Preprocessing and data set creation¶
Preprocessing steps Sophia
First approach to create a dataset (layout) and data augmentation to generate more samples. Preparation for manual feature extraction, background removal, etc. … Preparing the images for manual classification to create more labelled data: sorting the pictures in the grid, have 3 pictures per asparagus spear, etc. …
Automatic feature extraction Sophia, Michael
We (tried to) create scripts for an automatic feature extraction pipeline (including the decision to sort for features and not labels) for rust, bent, etc. … (all features)
The hand-label app Michael
Introduction to the script created for manual sorting. Fusion of the feature extraction scripts: What is it? Why did we need it? What was the idea behind it? How does it work? What is the output of the app?
Manual labeling Josefine
Sorting criteria Josefine
The criteria explained in detail for the hand-labeling of the features with the app (including example pictures). What are expected difficulties we might encounter?
Sorting outcome Josefine
The process and the results of the sorting: How much did we sort? How well did the sorting work in general: i.e., was it easy to sort? How long did it take? What problems were encountered?
Agreement measures Malin
Theoretical background on a measurement that assesses our sorting agreement.
Reliability Malin
Expanding on how accurately we sorted/how valid our sorting was as a group. Introducing the Kappa Agreement.
The asparagus data set Richard, Sophia
Structural information on the datasets: What do they look like? How big are they (labelled vs unlabelled samples)? Which were criteria for throwing out data? Problems and challenges during the creation of the datasets: What were the challenges in creating a general dataset? What were challenges in general? How well could we work with the datasets? What was used as training data, validation data, and test data?
4 Classification¶
Supervised learning Josefine
Feature Engineering Michael Single-label classification Josefine Multi-label classification Sophia Head-related Feature Network Michael From Features to Labels Katharina
Unsupervised learning Malin
Principal component analysis Malin, Maren
Semi-supervised learning Michael
Autoencoder Michael
5 Summary Maren¶
5 Discussion Malin¶
Classification results Malin, Michael
Methodology Malin
Organization Josefine, Richard
6 Conclusion Richard¶
Outlook of the project. Contribution to scientific landscape?