Exploring Echoregions Regions2D Functionality

Exploring Echoregions Regions2D Functionality#

This notebook parses region values from an Echoview .evr file and creates a region mask for the corresponding Echogram data.

Installation#

Prior to running this notebook and all other notebooks, make sure to pip install Echoregions and Echopype Plotting Library.

Install Using PyPi:

pip install echoregions

pip install echopype[plot]

Install Using Latest Github Main Branch Commit:

pip install git+https://github.com/OSOceanAcoustics/echoregions.git

pip install git+https://github.com/OSOceanAcoustics/echopype.git@plot

# Importing Packages
import matplotlib.pyplot as plt
import urllib.request
import shutil
import xarray as xr
import numpy as np
from pandas.testing import assert_frame_equal
from echopype.visualize.cm import cmap_d

import echoregions as er

Regions Data Reading#

To start this tutorial, we first download evr data from Echoregions’ Github Repository and parse the .evr file using Echoregions’ read_evr function.

The parsing is based off of the .evr data description shown on Echoview’s website: Region Attributes.

# Set path to test data
TEST_DATA_PATH = 'https://raw.githubusercontent.com/OSOceanAcoustics/echoregions/contains_transect_zip/echoregions/test_data'

# Download example EVR File
urllib.request.urlretrieve(f"{TEST_DATA_PATH}/transect.evr","transect.evr")

# Read EVR file
regions2d = er.read_evr('transect.evr')

Regions2D as a DataFrame#

regions2d is a specialized object but it has a data attribute which is a simple dataframe.

# Grab regions2d dataframe
regions2d_df = regions2d.data

regions2d_df.head(3)

	file_name	file_type	evr_file_format_number	echoview_version	region_id	region_structure_version	region_point_count	region_creation_type	dummy	...	region_bbox_right	region_bbox_top	region_bbox_bottom	region_class	region_type	region_name	time	depth	region_notes	region_detection_settings
0	transect.evr	EVRG	7	13.0.378.44817	1	13	4	6	-1	...	2019-07-02 08:10:09.425500	-9999.99	9999.99	Log	2	COM	[2019-07-02T03:50:54.629500000, 2019-07-02T03:...	[-9999.99, 9999.99, 9999.99, -9999.99]	[Switched from recording to "night" folder to ...	[]
1	transect.evr	EVRG	7	13.0.378.44817	2	13	4	6	-1	...	2019-07-02 12:32:31.740500	-9999.99	9999.99	Log	2	Com	[2019-07-02T12:32:30.175500000, 2019-07-02T12:...	[-9999.99, 9999.99, 9999.99, -9999.99]	[Recording 10 min of passive data]	[]
2	transect.evr	EVRG	7	13.0.378.44817	3	13	4	6	-1	...	2019-07-02 12:43:10.758500	-9999.99	9999.99	Log	2	COM	[2019-07-02T12:43:06.273000000, 2019-07-02T12:...	[-9999.99, 9999.99, 9999.99, -9999.99]	[End passive data collection and back to active]	[]

3 rows × 22 columns

The regions2d object can be subsetted using the select_region function and with parameters related to region class, time, and depth. For this example let us select a trawl region based on the region_class parameter:

trawl_regions = regions2d.select_region(region_class="Trawl")

trawl_regions

	file_name	file_type	evr_file_format_number	echoview_version	region_id	region_structure_version	region_point_count	region_selected	region_creation_type	dummy	...	region_bbox_right	region_bbox_top	region_bbox_bottom	region_class	region_type	region_name	time	depth	region_notes	region_detection_settings
17	transect.evr	EVRG	7	13.0.378.44817	18	13	4	0	4	-1	...	2019-07-02 19:52:21.531	9.244758	758.973217	Trawl	0	AWT20	[2019-07-02T18:40:51.809700000, 2019-07-02T18:...	[200.0, 300.0, 300.0, 200.0]	[]	[]
18	transect.evr	EVRG	7	13.0.378.44817	19	13	4	0	4	-1	...	2019-07-02 19:52:21.531	9.244758	758.973217	Trawl	0	AWT20	[2019-07-02T19:22:21.531000000, 2019-07-02T19:...	[220.0, 350.0, 350.0, 220.0]	[]	[]

2 rows × 22 columns

Now notice that these regions are not closed:

for _, point in trawl_regions.iterrows():
    plt.plot(point["time"], point["depth"])

_images/0094ca8d84a3e4d193ba7f2f2ae1ed5d5963726e93d2e2ac78deabb4bd50692b.png

We can close these regions and re-plot:

trawl_regions_closed = regions2d.close_region(region_class="Trawl")

for _, row in trawl_regions_closed.iterrows():
    plt.plot(row["time"], row["depth"])

_images/90392fc53bad52a5841b0390e39dbec5df19e18c01116cd0c0b622886479f691.png

To select Trawl regions, close regions, and plot regions, one can also just run the following using the object’s plot function:

regions2d.plot(region_class="Trawl", close_regions=True)

Plotting Echogram and Region#

From the two previous plots, one can kind of see how they’re related on both the depth and time dimensions. Now let’s see a region annotation overlayed on top of the Echogram dataset.

# Plotting the echogram data and the trawl region
plt.figure(figsize=(20, 6))
for _, point in trawl_regions_closed.iterrows():
    color = 'red' if point['region_id'] == 18 else 'black'  # Assign red for region ID 18 and black for region ID 19
    plt.plot(point["time"], point["depth"], fillstyle='full', markersize=1, color=color)

ds_Sv.Sv.isel(channel=1).T.plot(y="depth", yincrease=False, vmin=-70, vmax=-30, cmap=cmap_d["ek500"])

<matplotlib.collections.QuadMesh at 0x7f096f846770>

_images/9957a0c96155164d1a6d2bac659dd22d9490ea1b5a4342873b026f1894fb830f.png

Saving to “.csv” and Reading From “.csv”#

So now that we have our mask and our region polygon points, how do we save them?

We can use the Echoregions read_regions_csv function to first load it onto a region2d object and use the region2d object’s to_csv function to save the regions2d dataframe as a .csv.

# Create new regions2d object
from_mask_regions = er.read_regions_csv(region_points)

# Save to .csv
from_mask_regions.to_csv("from_mask_regions.csv", index=False)

Now if you need to load this .csv into a regions object we can again use read_regions_csv since it takes in both file locations (Path/str objects) and Pandas DataFrames:

# Create another new regions2d object
from_csv_regions = er.read_regions_csv("from_mask_regions.csv")

As another sanity check, let’s check if these dataframes are equal (with index reset):

try:
    assert_frame_equal(from_mask_regions.data.reset_index(drop=True), from_csv_regions.data.reset_index(drop=True))
    print("The two DataFrames are equal.")
except AssertionError:
    print("The two DataFrames are not equal.")

The two DataFrames are equal.