Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This document describes the weedsimagerepo azure storage account structure and how the pre processing metadata tables relate to each other.

...

  • wirweedsmeta: second level table. Contains PlantType = WEEDS only data.

    • PartitionKey (string, autogenerated): azure storage name

    • RowKey (string, autogenerated): unique id for each table entry. I don’t see the use for this column, uid is not used to relate this table to others nor is it present in the blob.

    • CropOrFallow (string, user input, multiple choice): whether the field where the images of target weeds will be collected in a field where a crop was planted or not.

    • MasterRefID (string, autogenerated): unique id for each table entry. Used to relate this table to others.

    • SizeClass (string, user input, dropdown): determined by the size of the target plant. This collumn was added in the second year of image collection, previously we used height. Furthermore, there are 2 types of entries 1,2 and 3 and Small, Medium and Large; this is due to a change introduced at some point. The current levels are the later.

    • FlowerFruitOrSeeds (Boolean, user input, multiple choice): are there or are there not reproductive organs.

    • WeedType (string, user input, dropdown): Target weed species common name. These categories have changed since the app was first released, there may be categories that should be the same but are not, e.g. a species name with a binomial name where both words are camel case and the same name with only the first word camel case.

    • Height (string, user input, dropdown): ranges of heights. Determined by the size of the target plant. This field was only used the first year of image collection and was later on replaced by SizeClass.

    • CropType (string, user input, dropdown): crop that the target weeds are growing in. Only available if the answer to CropOrFallow is Crop.

    • Timestamp (date, autogenerated): is the date and time of upload to this storage.

Panel
panelIconId1f347
panelIcon:grapes:
panelIconText🍇
bgColor#FFFAE6

There is no reason for this table to exist since all the distinct variables that exist here belong in the higher level table.

  • wirimagerefs: third and lowest level table. This table contains metadata specific to each image file. The ImageURL field contains the image file name which is used to pair the data to the data.

    • PartitionKey (string, autogenerated): azure storage name

    • RowKey (string, autogenerated): unique id for each table entry. I don’t see the use for this column, uid is not used to relate this table to others nor is it present in the blob.

...

    • MasterRefID (string, autogenerated): unique id for each table entry. Used to relate this table to others.

    • ImageURL (string, autogenerated): url formed by the blob url + /image file name (e.g https://weedsimagerepo.blob.core.windows.net/weedsimagerepo/TXF03026.ARW, TXF03026.ARW being the image file name). This field is what allows to pair the tables metadata with the images stored in the blob.

    • ImageIndex (integer, autogenerated): order in which the images which are part of the same package were collected. Each package contains 10 images (0-9).

    • Timestamp (date, autogenerated): is the date and time of upload to this storage.

Blob

  • weedsimagerepo: contains all the image files. These files are related can be paired to the tables metadata saved in the tables by the field “Name”by using the image “Name” on this blob and the ImageURL field from the tables.