[sw-l] Broadcast corpus
Steve Slevinski
slevin at signpuddle.net
Sat Jun 28 22:03:55 EDT 2008
Hi Hasna,
Hasna Hocini wrote:
> what I need to know is how I can identify in the SWML file the end of
> a sentence?
In SWML, punctuation is handled the same as other symbols. To find the
end of a sentence, you'd have to find the punctuation that ends a
sentence. Most often it would be alone, but I have seen people put
punctuation with other signs for specific placement.
In the IMWA, the punctuation symbol IDs start with "08-04". This
punctuation includes commas, so you'll probably want a specific list of
BaseSymbols. Val, can you help us out here?
Regards,
-Steve
More information about the SW-L
mailing list