Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,72 @@
|
|
1 |
-
---
|
2 |
-
license: cc-by-sa-4.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-sa-4.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
base_model:
|
6 |
+
- patcdaniel/phytoClassUCSC
|
7 |
+
---
|
8 |
+
|
9 |
+
_This model and readme file are derivative of the [phytoClassUCSC](https://huggingface.co/patcdaniel/phytoClassUCSC) classifier._
|
10 |
+
|
11 |
+
## Model details
|
12 |
+
|
13 |
+
- A modified version of the [phytoClassUCSC](https://huggingface.co/patcdaniel/phytoClassUCSC) classifier, made to run on modern () tensorflow versions.
|
14 |
+
- Originally eveloped by the Kudela Lab from the Ocean Sciences Department at University of California, Santa Cruz.
|
15 |
+
- Current version trained in February, 2023.
|
16 |
+
- depthwise-CNN based on the Xception architecture [Chollet, F., 2017](https://arxiv.org/abs/1610.02357) with 134 layers using weights pretrained on ImageNet.
|
17 |
+
- An average pooling layer is used.
|
18 |
+
- Licensed under CC-BY-SA-4.0
|
19 |
+
- For questions on the original model email Patrick Daniel <[pcdaniel@ucsc.edu](pcdaniel@ucsc.edu)>
|
20 |
+
- For questions on the adaptions made to update the model email Alex Baldwin <[alewin@noc.ac.uk](alewin@noc.ac.uk)>
|
21 |
+
|
22 |
+
## Intended use
|
23 |
+
|
24 |
+
This model was designed and trained to work with IFCB data generated in Monterey Bay. While that does not mean it may not perform well in other locations, the distribution of training images reflects common phytoplankton observed at the Santa Cruz Wharf and Power Buoy locations. Independent model validation should be used when applying the model to other sites.
|
25 |
+
|
26 |
+
### Primary intended uses
|
27 |
+
|
28 |
+
Generalized micro-phytoplankton classifier for common taxa found in the Monterey Bay.
|
29 |
+
|
30 |
+
### Primary intended users
|
31 |
+
|
32 |
+
Researchers intersted in a general.
|
33 |
+
|
34 |
+
### Out-of-scope use cases
|
35 |
+
|
36 |
+
Observing and identifying rare or non-endemic taxa.
|
37 |
+
|
38 |
+
## Factors
|
39 |
+
|
40 |
+
Model classes were chosen based on common and resolvable phytoplankton taxa. Taxonomic groupings were chosen based on what researchers in the lab felt groups that could be confidently identified, given the expertise and research intersts of the lab.
|
41 |
+
|
42 |
+
### Instrument
|
43 |
+
|
44 |
+
Model was trained on images from Imaging FlowCytobot (IFCB) instruments primary deployed at the Santa Cruz Wharf and the Monterey Bay Aquarium Research Institute (MBARI) Power Buoy. The Santa Cruz Wharf IFCB (#104) is an early generation
|
45 |
+
|
46 |
+
## Metrics
|
47 |
+
|
48 |
+
Deployed model performance will vary with the natural variabilability in the observed phytoplankton communities over different time scales (seasonality). As such model performance should be evaluated throughout IFCb deployments using independently labled images.
|
49 |
+
|
50 |
+
### Model performance measures
|
51 |
+
|
52 |
+
Training model performace was evaluated using a held-back validation training set. F1-scores were calcuated for each class. [See Results here](https://stage-habdac-streamlit.srv.axds.co/Model_Metrics)
|
53 |
+
|
54 |
+
### Approaches to uncertainty and variability
|
55 |
+
|
56 |
+
Uncertainty is addressed by applying a set of class-specific thresholds for each prediction. This works reasonably well for out-of-distribution images.
|
57 |
+
|
58 |
+
## Training data
|
59 |
+
|
60 |
+
To Be Described
|
61 |
+
|
62 |
+
## Ethical considerations
|
63 |
+
|
64 |
+
None
|
65 |
+
|
66 |
+
## Caveats and recommendations
|
67 |
+
|
68 |
+
This model was developed as in interation of previous classification efforts and as such is subject to a history of decision making that is not captured here. For that reasons this classifier is not a panacea for all phytoplankton image data, but was specifically developed for looking at phytoplankton communities in Monterey Bay.
|
69 |
+
|
70 |
+
IFCB collected data are very context specific and subject to both observation configurations and small-scale variability.
|
71 |
+
|
72 |
+
Review section 4.9 of the [model cards paper](https://arxiv.org/abs/1810.03993).
|