Exploring Echoregions Lines Functionality

Exploring Echoregions Lines Functionality#

This notebook parses bottom values from an Echoview .evl file and creates a bottom mask for the corresponding Echogram data.

Installation#

Prior to running this notebook and all other notebooks, make sure to pip install Echoregions and Echopype Plotting Library.

Install Using PyPi:

pip install echoregions

pip install echopype[plot]

Install Using Latest Github Main Branch Commit:

pip install git+https://github.com/OSOceanAcoustics/echoregions.git

pip install git+https://github.com/OSOceanAcoustics/echopype.git@plot

# Importing Packages
import matplotlib.pyplot as plt
import urllib.request
import shutil
import xarray as xr
import numpy as np
import pandas as pd
from pandas.testing import assert_frame_equal
from echopype.visualize.cm import cmap_d

import echoregions as er

Bottom Data Reading#

To start this tutorial, we first download evl data from Echoregions’ Github Repository and parse the .evl file using Echoregions’ read_evl function.

The parsing is based off of the .evl data description shown on Echoview’s website: Line Attributes.

# Set path to test data
TEST_DATA_PATH = 'https://raw.githubusercontent.com/OSOceanAcoustics/echoregions/contains_transect_zip/echoregions/test_data'

# Download example EVL File
urllib.request.urlretrieve(f"{TEST_DATA_PATH}/transect.evl","transect.evl")

# Read EVL file
lines = er.read_evl('transect.evl')

Lines as a DataFrame#

lines is a specialized object but it has a data attribute which is a simple dataframe.

# Grab lines dataframe
lines_df = lines.data

lines_df

	file_name	file_type	evl_file_format_version	echoview_version	time	depth	status
0	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 18:39:41.321000	442.996834	3
1	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 18:39:42.679000	437.818405	3
2	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 18:39:44.031000	445.194735	1
3	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 18:39:45.380000	451.168987	3
4	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 18:39:46.728000	442.551006	3
...	...	...	...	...	...	...	...
3166	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 21:04:47.146000	760.707803	3
3167	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 21:04:47.147000	762.196532	3
3168	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 21:10:40.095000	766.613696	3
3169	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 21:10:40.096000	763.976879	3
3170	transect.evl	EVBD	3	13.0.378.44817	2019-07-02 21:17:22.145000	764.867052	0

3171 rows × 7 columns

Note the rightmost column status. Status values are generally described by the following:

0 = none

1 = unverified

2 = bad

3 = good

The good and bad values are assigned via the specific EVL line picking formula used to generate the initial EVL file. Generally, we only want the rows with good/3 status.

More information on Echoview Status can be found here: Line Status.

Let’s now plot good points.

# Status 3 are good points so we select those
good_lines_df = lines_df[lines_df['status'] == '3']
good_bottom = good_lines_df[['time', 'depth']]

plt.plot(good_bottom['time'], good_bottom['depth'], 'black')
plt.xlabel('Ping Time')
plt.ylabel('Depth')
plt.gca().invert_yaxis()

plt.show()

_images/2cfaf4ab438d19477fef2e111657d12e7af6a35d2ccf6a49f143a5f66d14b4ec.png

For usage later on, set this good dataframe as the current line dataframe:

lines.data = good_lines_df

Plotting Echogram and Bottom#

From the two previous plots, one can kind of see how they’re related on both the depth and time dimensions. Now let’s see bottom annotations overlayed on top of the Echogram dataset.

# Plotting the Echogram data and the bottom
plt.figure(figsize = (20, 6))
plt.plot(lines.data['time'], lines.data['depth'],'black',fillstyle='full', markersize=1)
ds_Sv.Sv.isel(channel=1).T.plot.pcolormesh(y="depth", yincrease=False, vmin=-70, vmax=-30, cmap=cmap_d["ek500"])

<matplotlib.collections.QuadMesh at 0x7f18c378e110>

_images/85ab609f9cb589b111b765d23e9542cc7301b66b50d7909c14a27f4cbc942ef2.png

Saving to “.csv” and Reading From “.csv”#

So now that we have our mask and our new interpolated bottom points, how do we save them?

We can use the Echoregions read_lines_csv function to first load it onto a lines object and use the lines object’s to_csv function to save the lines dataframe as a .csv.

# Create new lines object
from_mask_lines = er.read_lines_csv(bottom_points)

# Save to .csv
from_mask_lines.to_csv("from_mask_lines.csv")

Now if you need to load this .csv into a lines object we can again use read_lines_csv since it takes in both file locations (Path/str objects) and Pandas DataFrames:

# Create another new lines object
from_csv_lines = er.read_lines_csv("from_mask_lines.csv")
from_csv_lines.data = from_csv_lines.data.drop("Unnamed: 0", axis=1) # TODO: Fix this need to drop

Now let’s check if these dataframes are equal:

try:
    assert_frame_equal(from_mask_lines.data, from_csv_lines.data)
    print("The two DataFrames are equal.")
except AssertionError:
    print("The two DataFrames are not equal.")

The two DataFrames are equal.