Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

“The scarcity of public image datasets remains a key bottleneck in developing next-generation computer vision and intelligent systems for precision agriculture.” (Lu and Young 2020)

Why?

Developing image datasets is difficult, especially for agricultural applications. Collecting and annotating images pose major challenges.

Image Collection

Data hungry deep learning models require hundreds of thousands, sometimes millions, of images to train and become invariant to changes in location, lighting, backgrounds, or other factors. Devising high-throughput systems is necessary to capture hundreds of thousands of images of weeds at various growth stages while affected by field-like conditions. A network of agronomic, hardware, and software engineers is needed to automate large scale image collection.

Labeling is Time Consuming

Pixel-wise labels must be accurate. The process of labeling by hand can take minutes to hours even when using third-part annotation tools. Many have reported the high time requirement needed for labeling images of weeds.

Source

Annotation Technique

Scene type

Time

Cicco et al. (2017)

manual segmentation of cutouts

Simple

5-30 min / real image

Skovsen et al. (2019) (in conversation)

manual segmentation of cutouts

complex (field)

hour(s) / real images

Sa et al. (2017)

manual segmentation of drone images of 3 classes

complex (field)

60 min / image

Bosilj et al. (2020)

manual segmentation 3 classes

complex (field)

2-4 hrs / image

Labeling is Expensive

Texas A&M Case Study

Used a third-party labeling service to sematically label images of agricultural weeds and plants at early growth stages roughly 2-6 weeks after emergence. Details are as follows:

  • Company: Precise BPO Solution

  • 1000 images

  • 11726 segments

  • $0.125 per segment*

    • *While TAMU was given a discount of $0.095 per segment, $0.125, the non-discounted price, is used here. It is unlikely the company will provide the same discount for images over 1000.

  • Time to label all images - 2.5 weeks

  • $1465.75 total cost

Agricultural Scenes are Diverse

  • High individual diversity from differences in growth stages and effects of biotic and abiotic factors on morphology

  • Large intra-species variation and similarity with other species

  • Complex agricultural scenes as the result of effect from climate, soil, ground residue, and other weed populations.

Labeling Cost Projections for SemiField

We can estimate the expected costs and time for SemiField data collection using the Texas A&M numbers.

  • here we use a smaller average of 8 segments per image (instead of 11.726 like TAMU)

  • $0.125 per segment

  • 2.5 weeks (13 working days from 8am - 5pm) = ~104 worker hours

  • 104 hours translates to 32 seconds per segment (104 / 11726)

Images

Worker Hours estimate

Total Cost ($)

1,000

6249.958

$1,465.750

1,000

4264

$1,000.000

10,000

42640

$10,000.000

25,000

106600

$25,000.000

100,000

426400

$100,000.000

250,000

1066000

$250,000.000

We don't have a way to export this macro.
  • No labels