145 research outputs found
THz generation using a reflective stair-step echelon
We present a novel method for THz generation in lithium niobate using a
reflective stair-step echelon structure. The echelon produces a discretely
tilted pulse front with less angular dispersion compared to a high
groove-density grating. The THz output was characterized using both a 1-lens
and 3-lens imaging system to set the tilt angle at room and cryogenic
temperatures. Using broadband 800 nm pulses with a pulse energy of 0.95 mJ and
a pulse duration of 70 fs (24 nm FWHM bandwidth, 39 fs transform limited
width), we produced THz pulses with field strengths as high as 500 kV/cm and
pulse energies as high as 3.1 J. The highest conversion efficiency we
obtained was 0.33%. In addition, we find that the echelon is easily implemented
into an experimental setup for quick alignment and optimization.Comment: 19 pages, 4 figure
Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
We propose a method of segmenting long-form speech by separating semantically
complete sentences within the utterance. This prevents the ASR decoder from
needlessly processing faraway context while also preventing it from missing
relevant context within the current sentence. Semantically complete sentence
boundaries are typically demarcated by punctuation in written text; but
unfortunately, spoken real-world utterances rarely contain punctuation. We
address this limitation by distilling punctuation knowledge from a
bidirectional teacher language model (LM) trained on written, punctuated text.
We compare our segmenter, which is distilled from the LM teacher, against a
segmenter distilled from a acoustic-pause-based teacher used in other works, on
a streaming ASR pipeline. The pipeline with our segmenter achieves a 3.2%
relative WER gain along with a 60 ms median end-of-segment latency reduction on
a YouTube captioning task.Comment: Interspeech 2023. First 3 authors contributed equall
- …