Report overview

This is the overview for the report. The actual report is in the main.tex file in the same folder and constructed from the other .tex files in the Chapters folder. To see the most recent changes in the report, please create a pdf file from the main.tex file or have a look at asparagus-report.pdf.

Task distribution is marked like this.

1 Introduction

The project Josefine

Rough summary of the idea of the project.    How did the idea come up?    What is the project about?

Background on computer vision based classification tasks Sophia

Short introduction into the field of computer vision and short introduction to artificial neural networks.    Prospects and challenges of both on a general basis.    This part is kept very brief.

Background on sorting asparagus Josefine

What is the main focus during the sorting process, i.e. why do you need to sort asparagus and in which    classes do you sort it?    What problems and challenges will be met - including the difference of challenge for humans vs. machines.    Including the decision tree for the labels as sorted with Silvan’s machine.

2 Data acquisition and organization

Timetable (roadmap) of the project Josefine

A (visual) overview of the project’s course during the year.    How we planned the project vs. how the actual timetable of the project looked like.

Organisation of the study group Richard

    Communication Richard

Deciding on a communication platform and handling it: organization in general, team meetings,    Asana, Telegram, and any other virtual form of communication

    Teamwork Richard

The experience of working in a team and organising/distributing tasks in a group:    task distribution, cooperation on tasks, GitHub: organising our project with Git,    Grid: working together with the IKW data store, other teamwork related experiences    (could we integrate the strengths and weaknesses of the single team members) …

Data collection Josefine, Richard

How we collected the data and what the data looked like.    Here, the process of driving to Rheine and collecting the data with an external hard-drive is described.    Understanding the sorting machine and its output.    First problems that had to be resolved: labelled vs. unlabelled data ( -> running pre-sorted pieces    through the sorting machine, did not (completely) resolve our problem), saving the data manually    on an external harddrive ( -> solved by building a script for data transfer, and Teamviewer sessions)

Literature research Josefine

Previous literature research concerning food classification and handling unlabelled data.    Searching for background literature close to our project, e.g. automatic CV-based sorting of other    food products.    Re-reading on potential ANN structures that we could use for sorting.    Could we rely on a certain paper/process? Did it work?

3 Preprocessing and data set creation

Preprocessing steps Sophia

First approach to create a dataset (layout) and data augmentation to generate more samples.    Preparation for manual feature extraction, background removal, etc. …    Preparing the images for manual classification to create more labelled data: sorting the pictures in    the grid, have 3 pictures per asparagus spear, etc. …

Automatic feature extraction Sophia, Michael

We (tried to) create scripts for an automatic feature extraction pipeline    (including the decision to sort for features and not labels) for    rust, bent, etc. … (all features)

The hand-label app Michael

Introduction to the script created for manual sorting. Fusion of the feature extraction scripts:    What is it? Why did we need it? What was the idea behind it? How does it work? What is the output of the app?

Manual labeling Josefine

    Sorting criteria Josefine

The criteria explained in detail for the hand-labeling of the features with the app (including example    pictures). What are expected difficulties we might encounter?

    Sorting outcome Josefine

The process and the results of the sorting: How much did we sort? How well did the sorting work in general:    i.e., was it easy to sort? How long did it take? What problems were encountered?

    Agreement measures Malin

Theoretical background on a measurement that assesses our sorting agreement.

    Reliability Malin

Expanding on how accurately we sorted/how valid our sorting was as a group. Introducing the Kappa Agreement.

The asparagus data set Richard, Sophia

Structural information on the datasets: What do they look like? How big are they (labelled vs unlabelled    samples)? Which were criteria for throwing out data?    Problems and challenges during the creation of the datasets: What were the challenges in creating a    general dataset? What were challenges in general? How well could we work with the datasets? What was    used as training data, validation data, and test data?

4 Classification

Supervised learning Josefine

    Feature Engineering Michael    Single-label classification Josefine    Multi-label classification Sophia    Head-related Feature Network Michael    From Features to Labels Katharina

Unsupervised learning Malin

    Principal component analysis Malin, Maren

Semi-supervised learning Michael

    Autoencoder Michael

5 Summary Maren

5 Discussion Malin

Classification results Malin, Michael

Methodology Malin

Organization Josefine, Richard

6 Conclusion Richard

Outlook of the project. Contribution to scientific landscape?