Preprocessed full resolution images. Used in various stages of pipeline. Are not modified or altered. Output product of Preprocessing stage which includes raw image conversion, color card calibration, and other color corrections.
<example of fullsized images here> Check google drive sharefolder for details or pull from preprocessing pipeline (reach out to me if you need help with that) or Matt can get some examples.
2. Masks
2.1 Full Sized Semantic Masks
Semantic labels output from segment-vegetation. Color labels are located in Image and Cutout metadata jsons.
Semantic masks - pixel-wise labels by species
Pixel values are the class id
Rgb values can be used to consistently remap class id pixel values for visualization purposes
<examples of the fullsized masks here> examples of masks can be found here: Data Examples
3. Metadata
various metadata including camera, species, and localization information. Output of remap stage, main input segmentation-vegetation stage that provides bounding box and species information. Cutout ID are added during segment-vegetation
This schema defines the structure for metadata related to images captured in a semifield environment. Each image is accompanied by detailed metadata for exif information, camera settings, annotations, and categorical classifications.
This README provides a detailed explanation of the metadata properties used in the Semifield-developed-image and Semifield-cutouts schemas. Each schema is designed to capture essential metadata for image processing, categorization, and analysis in agricultural and machine learning applications.
3.11 General Properties
Property
Type
Description
Property
Type
Description
season
string
The targeted plant type and year during which images were captured (e.g., weeds_2022).More Info
datetime
string
The date and time when the image was captured format.
bbot_version
string
The version of the batch bot system used for processing the image. More Info
batch_id
string
A unique identifier made up of a state abbreviation and date for the batch that contains the image (e.g., MD_2024-01-12).
image_id
string
A unique identifier made up of a state abbreviation and unix epoch timestamp for each image.
validated
boolean
Indicates whether the quality of species labels, bboxes, and masks has been validated (true or false).
version
string
data version number. Include metadata and masks
3.12 Exif Meta Properties
Property
Type
Description
Property
Type
Description
ImageWidth
integer
The width of the image in pixels.
ImageLength
integer
The height of the image in pixels.
Make
string
The manufacturer of the camera used to capture the image.
Model
string
The model of the camera used to capture the image.
Software
string
The software used to process or edit the image.
DateTime
string (date-time)
The date and time when the image was captured.
ExposureTime
string
The exposure time used for capturing the image.
FNumber
string
The F-number (aperture) used during image capture.
ExposureProgram
integer
The camera exposure program mode (e.g., manual, auto).
ISOSpeedRatings
integer
The ISO speed rating set on the camera.
RecommendedExposureIndex
integer
The recommended exposure index for the image.
ExifVersion
array of integers
The version of the EXIF metadata.
BrightnessValue
string
The brightness of the image.
MaxApertureValue
string
The maximum aperture value of the camera lens.
LightSource
integer
The type of light source (e.g., daylight, fluorescent).
Flash
integer
Indicates if the flash was used when capturing the image.
FocalLength
string
The focal length of the camera lens in millimeters.
ExposureMode
integer
The exposure mode set on the camera (e.g., auto or manual).
WhiteBalance
integer
The white balance setting of the camera.
FocalLengthIn35mmFilm
integer
The focal length equivalent to 35mm film.
Contrast
integer
The contrast setting applied to the image.
Saturation
integer
The saturation setting applied to the image.
Sharpness
integer
The sharpness setting applied to the image.
LensModel
string
The model of the camera lens used for capture.
LensSpecification
array of strings
Specifications for the camera lens (e.g., focal length range).
BodySerialNumber
string or null
The serial number of the camera body.
3.13 Camera Info Properties
Property
Type
Description
Property
Type
Description
aligned
boolean
Indicates whether the image was aligned during reconstruction More Info.
estimated_xyz
array of numbers
Estimated XYZ position of the camera in 3D space.
estimated_pitch
number
The estimated pitch angle of the camera in degrees.
estimated_yaw
number
The estimated yaw angle of the camera in degrees.
estimated_roll
number
The estimated roll angle of the camera in degrees.
pixel_width
number
The estimated width of a single pixel in the image.
pixel_height
number
The estimated height of a single pixel in the image.
focal_length
number
The focal length of the camera in pixels.
camera_coefficients
object
Coefficients for the camera calibration model.
fov
object
Field of view properties for the camera.
3.14 Field of View
Property
Type
Description
Property
Type
Description
height
number
width
number
top_left_xy
array of numbers
top_right_xy
array of numbers
bottom_left_xy
array of numbers
bottom_right_xy
array of numbers
fov_area_cm2
number
3.15 Annotations Properties
Property
Type
Description
Property
Type
Description
is_primary
boolean
Indicates if the annotation is the primary annotation. More info
cutout_exists
boolean
Indicates if a cutout corresponding to the annotation exists.
bbox_xywh
array of 4 integers
The bounding box for the annotation in [x, y, width, height] format. xy is of the top left corner.
category_class_id
integer
The class identifier for the category of the object in the annotation.
cutout_id
string
A unique identifier derived from the image_id and appended with an integer denoting the cutout number.
overlapping_cutout_ids
array of strings
A list of cutouts that overlap with this one.
non_target_weed
boolean
non target weed classifier prediction results
non_target_weed_pred_conf
number
prediction confidence of model indicating whether it’s a non-target weed. From custom trained classifier
3.16 Categories Properties
Property
Type
Description
Property
Type
Description
class_id
integer
The identifier for the classification category.
USDA_symbol
string
The USDA symbol for the category.
EPPO
string
The EPPO code for the category.
group
string
The group (monocot or dicot) to which the category belongs.
class
string
The class to which the category belongs.
subclass
string
The subclass to which the category belongs.
order
string
The order of the category.
family
string
The family of the category.
genus
string
The genus of the category.
species
string
The species of the category.
common_name
string
The common name for the category.
authority
string
The authority that classified the category.
growth_habit
string
The growth habit of the plant (e.g., vine, shrub).
duration
string
The duration of the plant's life cycle (e.g., annual, perennial).
category
string
The general category to which the classification belongs.
multi_species_USDA_symbol
string or null
List of species in a class with multiple species. Rarely used.
link
string or null
A link to more information about the category.
note
string or null
Any notes related to the category.
rgb
array of 3 integers
hex color id.
hex
string
3.2 Detailed Descriptions
3.21 Season Details
Description: The season field indicates the specific growing season plant type that was targeted and the year during which images were captured.
Valid Values:
weeds_2022
weeds_2024
weeds_2025
cash_crops_2023
cover_crops_2022_2023
cover_crops_2023_2024
cover_crops_2024_2025
Usage Notes:
These are only general labels and are not strict. The majority of the plants are describved by the label, but in some cases other plant types may be mixed in. For example, Purple nutsedge could be in cover_crops_2023_2024 to take advantage of the longer growing season, despite it not being a cover crop.
This field is used to group images based on seasonal context, which is essential for comparative analysis across different time periods.