Ozlem Kalinli

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Jay Mahadeokar

> Home > Persons > Ozlem Kalinli

Publications

2025
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiaKZL0WSMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiaKZL0WSMK25
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. ICASSP 2025: 1-5
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajKJMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajKJMK25
Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. ICASSP 2025: 1-5
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YadavKRZJLXWMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YadavKRZJLXWMK25
Amit Kumar Singh Yadav, Gil Keren, Desh Raj, Wei Zhou, Junteng Jia, Ke Li, Ying Xu, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli:
Speech-N-LlaMA: Improving Speech LLMs with Multi-Pass Training. ICASSP 2025: 1-5
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangRLMJKLHDMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangRLMJKLHDMK25
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. ICASSP 2025: 1-5
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouJSMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouJSMK25
Wei Zhou, Junteng Jia, Leda Sari, Jay Mahadeokar, Ozlem Kalinli:
CJST: CTC Compressor based Joint Speech and Text Training for Decoder-Only ASR. ICASSP 2025: 1-5
2024
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShangguanYLWFWD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShangguanYLWFWD24
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models. ICASSP 2024: 10216-10220
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieLGTSSWJMK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieLGTSSWJMK24
Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model. ICASSP 2024: 12201-12205
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoMMSWMKFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoMMSWMKFS24
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FathullahWLJSLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FathullahWLJSLG24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FathullahWLLJSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FathullahWLLJSM24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01716
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08148
Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. CoRR abs/2409.08148 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11494
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. CoRR abs/2409.11494 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01162
Wonjune Kang, Junteng Jia, Chunyang Wu, Wei Zhou, Egor Lakomkin, Yashesh Gaur, Leda Sari, Suyoun Kim, Ke Li, Jay Mahadeokar, Ozlem Kalinli:
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech. CoRR abs/2410.01162 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03752
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. CoRR abs/2410.03752 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-07607
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-07607
Wei Zhou, Junteng Jia, Leda Sari, Jay Mahadeokar, Ozlem Kalinli:
CJST: CTC Compressor based Joint Speech and Text Training for Decoder-Only ASR. CoRR abs/2411.07607 (2024)
2023
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/JiaLMMMKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/JiaLMMMKS23
Junteng Jia, Ke Li, Mani Malek, Kshitiz Malik, Jay Mahadeokar, Ozlem Kalinli, Frank Seide:
Joint Federated Learning and Personalization for on-Device ASR. ASRU 2023: 1-8
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMGSKKSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMGSKKSL23
Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le:
Improving fast-slow Encoder based Transducer with Streaming Deliberation. ICASSP 2023: 1-5
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajJMWMZK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajJMWMZK23
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FathullahWSJXML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FathullahWSJXML23
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12498
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00998
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00998
Shuo Liu, Leda Sari, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli:
Towards Selection of Text-to-speech Data to Augment ASR Training. CoRR abs/2306.00998 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11795
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01947
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models. CoRR abs/2309.01947 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13018
Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model. CoRR abs/2309.13018 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06753
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023)
2022
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiWWXMZLLSNKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiWWXMZLLSNKS22
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaMZSKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaMZSKS22
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. INTERSPEECH 2022: 536-540
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSLLZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSLLZC22
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast slow cascaded encoders. INTERSPEECH 2022: 2083-2087
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15773
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast-slow cascaded encoders. CoRR abs/2203.15773 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15966
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. CoRR abs/2203.15966 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11588
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022)
2021
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeJKKSMCSFKSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeJKKSMCSFKSS21
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiNWMLPXYCFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiNWMLPXYCFKS21
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSSWXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSSWXS21
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShangguanPSMSZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShangguanPSMSZW21
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02176
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02194
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02207
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02232
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05241
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.