Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This document describes the weedsimagerepo azure storage account structure and how the pre processing metadata tables relate to each other.

...

  • wirmastermeta: first level table. Bellow are the most relevant fields that it contains:

    • PatitionKey (string, autogenerated): azure storage name

    • RowKey (string, autogenerated): unique id for each table entry. Used to relate this table to others.

    • UsState (string, user input, dropdown): Unique partner code. There are inconsistencies in the partner code since we have more than 1 partner for some of the states which results in having some 2 characters codes and other 4 characters codes. I would like to modify this so all codes are 4 characters, the first 2 letters for the state followed by 2 numbers (01, 02, etc). The back end parter codes are called affiliations in the front end and are formed by the US state initials + the primary investigator’s last name for that group (e.g. MD-Mirsky).

    • PlantType (string, user input, dropdown): Three plant categories, all upper case, no spaces.

    • CloudCover (string, user input, dropdown):

    • GroundResidue (string, user input, dropdown and type): type of ground residue, e.g. previous crop in the rotation.

    • GroundCover (string, user input, dropdown): 5 ranges from 0 to 100% coverage.

    • Timestamp (date, autogenerated): is the date and time of upload to this storage.

    • Username (string, user input, type): This one is a free for all, we didn’t ask the users to enter anything specific. In some cases they did enter a name in others just a letter or initials. There are also empty cells sue to an early version of the app which didn’t require the users to complete this field. There can be multiple user names per partner code.

    • WeedsOrCrops: This column has a few entries and it’s just redundant information which repeats the contents of the PlantType field.

  • wircovercropsmeta: second level table. Contains PlantType = COVERCROP only data. Things to note about the data in this table: PartitionKey and Affiliation both contain the same information and these information already exists in the higher level table as UsState. CloudCover, GroundResidue and GroundCover are also repeated from the higher level table.

    • FlowerFruitOrSeeds (Boolean, user input, multiple choice): are there or are there not reproductive organs.

    • CoverCropSpecies (string, user input, dropdown): species of cover crops specifically selected for this repository.

    • CoverCropFamily (string, user input, dropdown): category of cover crop.

...

  • wirimagerefs: third and lowest level table. This table contains metadata specific to each image file. The ImageURL field contains the image file name which is used to pair the data to the data.

    • PartitionKey (string, autogenerated): azure storage name

    • RowKey (string, autogenerated): unique id for each table entry. I don’t see the use for this column, uid is not used to relate this table to others nor is it present in the blob.

    • MasterRefID (string, autogenerated): unique id for each table entry. Used to relate this table to others.

    • ImageURL (string, autogenerated): url formed by the blob url + /image file name (e.g https://weedsimagerepo.blob.core.windows.net/weedsimagerepo/TXF03026.ARW, TXF03026.ARW being the image file name). This field is what allows to pair the tables metadata with the images stored in the blob.

    • ImageIndex (integer, autogenerated): order in which the images which are part of the same package were collected. Each package contains 10 images (0-9).

    • Timestamp (date, autogenerated): is the date and time of upload to this storage.

...