Content Comparison

...

Code Block
jupyter lab /path/to/folder

...

Basic conversions

R

Python

Variable assignment

name <- "John Doe"

name = "John Doe"

R does accept the = syntax for assignment, but it’s non-idiomatic and can lead to unexpected errors.

For example,

Code Block

language	r

my_function <- function(y = 3, x = 5) {
  print(paste(y, x))
}

my_function(x = 10)   # Passes 10 to the x parameter, and outputs "3 10"

my_function(x <- 20)  # Assigns 20 to the global x variable, which it passes to the y parameter. Outputs "20 5"

To avoid confusion, always use <- for assignment, and use = for passing arguments within function calls.

Declaring a Range

Code Block

language	r

1:10
seq(1, 20, 2)
print(1:10)

Code Block

language	py

range(1, 11)
range(1, 21, 2)
print(list(range(1, 11)))

Ouch. Python’s code is not only longer, but it excludes the final number.
range(1, 11) creates a range object in Python, which represents the numbers 1 through 10.

list(range(1, 11)) creates a list object in Python, which represents [1, 2, 3, … 10]
Both are similar to the 1:10 vector in R. But you need to change a range to a list in order to output it.

...

Example conversions

The following examples demonstrate how to convert the R code at https://github.com/jjennewein/MDA_2023-24 into equivalent Python code.

0.field buffer.R

0.field buffer.py

Code Block

language	r

library(sf)

Code Block
import geopandas as gpd

Loads a library for handling geospatial data, including working with shapefiles.

In the Python code, gpd is an alias for the geopandas library, which we use to reduce typing and improve code readability.

Code Block

language	r

setwd(dirname(rstudioapi::getActiveDocumentContext()$path))

RStudio defaults to a global working directory, which means file paths must be specified explicitly.

Python defaults to the folder where the source code is located.

Jupyter Notebook defaults to where you launch the notebook server.

You could launch it from the command line within the source folder:
Code Block
cd \psa\MDA_2023-24\python jupyter notebook
Or you could specify the folder directly when launching Jupyter:
Code Block
jupyter notebook \psa\MDA_2023-24\python

The original file path was "E:/Eastern_Shore/MD_enrollment_data_2023_24/fields/", but I don't have access to that location.

To avoid referencing "E:/Eastern_Shore..." explicitly, I saved all the necessary project files to the same folder as the source code, and I added a setwd() statement to the R source. I then removed all references to "E:/Eastern_Shore..."

This way, the R program can run without requiring the full path in filenames.

Code Block

language	r

mda = st_read("MDAEnrollment2023-2024.shp") %>%
  st_transform(., crs = 32618) %>%
  st_buffer(., dist = -30) %>%
  dplyr::filter(!st_is_empty(.))

Code Block

language	py

mda = gpd.read_file("MDAEnrollment2023-2024.shp") \
      .to_crs(epsg=32618)
mda["geometry"] = mda.geometry.buffer(-30)
mda = mda[~mda.geometry.is_empty]

This code:

Reads a shapefile into a spatial data frame (R) or a GeoDataFrame (Python).
Converts the coordinate reference system (CRS) to EPSG 32618 (WGS 84).
Applies a negative buffer of 30 meters to shrink the geometries.
Note that sf objects in R have a designated geometry column, but in GeoPandas you need to explicitly reference the geometry column.
Filters out any empty geometries resulting from the buffering.
In Python, ~ is a bitwise NOT operator.

R uses %>% for chaining operations, while Python uses . (dot notation).

In Python, a backslash (\) means line continuation. I used it here to align the code with the R version. But I could have also written it as:
mda = gpd.read_file("MDAEnrollment2023-2024.shp").to_crs(epsg=32618)

Also, note that the dots in st_transform(., crs = 32618) and st_buffer(., dist = -30) are optional but may improve readability.

Code Block
st_write(mda, "MDAEnrollment2023-2024_Buff30m.shp", append=FALSE)

Code Block
mda.to_file("MDAEnrollment2023-2024_Buff30m.shp")

Writes the buffered data frame to a new shapefile.

I added append=FALSE to the original source so the file would be overwritten if it exists. GeoPandas automatically overwrites existing files.

1.Extract_HLS_L30 .R

1.Extract_HLS_L30 .py

Code Block

language	r

library(dplyr)
library(terra)
library(sf)
library(exactextractr)

Code Block
import pandas as pd import rasterio import geopandas as gpd from rasterstats import zonal_stats

Syntax is easy. Knowing which libraries to use – not so much!

Code Block

language	r

setwd(dirname(rstudioapi::getActiveDocumentContext()$path))

Explained previously.

Code Block

language	r

fields = st_read("MDAEnrollment2023-2024_Buff30m.shp")

Code Block

language	py

fields = gpd.read_file("MDAEnrollment2023-2024_Buff30m.shp")

Reads the buffered shapefile.

Code Block
plot(fields[1])

Code Block
fields.plot(edgecolor='black')

Plots the shapefile.

edgecolor isn’t required in the Python code. It simply makes the output consistent with the R output.

Version	Old Version 6	New Version 7
Changes made by	rickhitchcock	rickhitchcock
Saved on	Oct 25, 2024	Nov 08, 2024

Versions Compared

Key

Basic conversions

Example conversions