In running text, we will normally not capture these at all, since we do not normally capture Hebrew. In rare circumstances, in English language context, we may need to capture these, usually as stand-alone (spacing) marks. The following marks have appeared in our books so far. Others will be added to this list as needed.
Browsers are inconsistent in honoring the zero-space and rtl properties of these marks, which makes their interaction with surrounding text in English context unpredictable. I suggest that if we need a spacing version, we should always surround each entity with a non-breaking space entity. In the one book in which we've met them, the printer combined them with a horizontal line in order to indicate whether they were a low or a high diacritic. In this book, I would suggest capturing the line+cantillation combination as a string consisting of mdash -- cantillation entity -- mdash.
| Unicode name | Other name(s) | entity name | NCR with mdashes | EXAMPLES |
|---|---|---|---|---|
| DARGA (U+05A7) | &darga; |
| ||
| ETNAHTA (U+0591) | &etnahta; |
| ||
| GERESH (U+059C) | &geresh; |
| ||
| ||||
| ||||
| ||||
| ||||
| ||||
| ||||
| ||||
| ||||
| ||||
|