1,805 research outputs found
The effect of peculiar velocities on the epoch of reionization (EoR) 21-cm signal
We have used semi-numerical simulations of reionization to study the
behaviour of the power spectrum of the EoR 21-cm signal in redshift space. We
have considered two models of reionization, one which has homogeneous
recombination (HR) and the other incorporating inhomogeneous recombination
(IR). We have estimated the observable quantities --- quadrupole and monopole
moments of HI power spectrum at redshift space from our simulated data. We find
that the magnitude and nature of the ratio between the quadrupole and monopole
moments of the power spectrum () can be a possible probe for the
epoch of reionization. We observe that this ratio becomes negative at large
scales for irrespective of the reionization model, which is a
direct signature of an inside-out reionization at large scales. It is possible
to qualitatively interpret the results of the simulations in terms of the
fluctuations in the matter distribution and the fluctuations in the neutral
fraction which have power spectra and cross-correlation ,
and respectively. We find that at large scales
the fluctuations in matter density and neutral fraction is exactly
anti-correlated through all stages of reionization. This provides a simple
picture where we are able to qualitatively interpret the behaviour of the
redshift space power spectra at large scales with varying entirely in
terms of a just two quantities, namely and the ratio . The nature of becomes different for HR and IR
scenarios at intermediate and small scales. We further find that it is possible
to distinguish between an inside-out and an outside-in reionization scenario
from the nature of the ratio at intermediate length scales.Comment: 11 pages, 6 figures. Accepted for publication in MNRAS. Replaced to
match the accepted version. Added one appendix to quantify the possible
uncertainties in the estimation of the multipole moments of redshift space
power spectru
Detecting ionized bubbles in redshifted 21 cm maps
The reionization of the Universe, it is believed, occurred by the growth of
ionized regions (bubbles) in the neutral intergalactic medium (IGM). We study
the possibility of detecting these bubbles in radio-interferometric
observations of redshifted neutral hydrogen (HI) 21 cm radiation. The signal 1
mJy will be buried in noise and foregrounds, the latter being at least a few
orders of magnitude stronger than the signal. We develop a visibility based
formalism that uses a filter to optimally combine the entire signal from a
bubble while minimizing the noise and foreground contributions. This formalism
makes definite predictions on the ability to detect an ionized bubble or
conclusively rule out its presence in a radio- interferometric observation. We
make predictions for the currently functioning GMRT and a forthcoming
instrument, the MWA at a frequency of 150 MHz (corresponding to a redshift of
8.5). For both instruments, we show that a 3 sigma detection will be possible
for a bubble of comoving radius R_b > 40 Mpc (assuming it to be spherical) in
100 hrs of observation and R_b 22 Mpc in 1000 hrs of observation, provided the
bubble is at the center of the field of view. In both these cases the filter
effectively removes the expected foreground contribution so that it is below
the signal, and the system noise is the deciding criteria. We find that there
is a fundamental limitation on the smallest bubble that can be detected arising
from the statistical fluctuations in the HI distribution. Assuming that the HI
traces the dark matter we find that it will not be possible to detect bubbles
with R_b < 8 Mpc using the GMRT and R_b < 16 Mpc using the MWA, however large
be the integration time.Comment: 11 pages, 10 figures, 1 table. Accepted for Publication in MNRAS.
Revised to match the accepted versio
Constraining Quasar and IGM Properties Through Bubble Detection in Redshifted 21-cm Maps
The infrared detection of a z>7 quasar has opened up a new window to directly
probe the IGM during the epoch of reionization. In this paper we theoretically
consider the possibility of detecting the ionized bubble around a z=8 quasar
using targeted redshifted 21-cm observations with the GMRT. The apparent shape
and size of the ionized bubble, as seen by a distant observer, depends on the
parameters \dot{N}_{phs}/C, x_HI/C and \tau_Q where \dot{N}_{phs}, \tau_Q, x_HI
and C are respectively the photon emission rate, age of the quasar, the neutral
fraction and clumping factor of the IGM.Here we have analytically estimated the
shape and size of a quasar's ionized bubble assuming an uniform IGM and
ignoring other ionizing sources besides the quasar, and used this as a template
for matched filter bubble search with the GMRT visibility data. We have assumed
that \dot{N}_{phs} is known from the infrared spectrum and C from theoretical
considerations, which gives us two free parameters x_HI and \tau_Q for bubble.
Considering 1,000 hr of observation, we find that there is a reasonably large
region of parameter space where a 3\sigma detection is possible. We also find
that it will be possible to place lower limits on x_HI and \tau_Q with this
observation. Deeper follow up observations can place upper limits on \tau_Q and
x_HI. Value of C affect the estimation of x_HI but the estimation of \tau_Q
remains unaffected.We have used a semi-numerical technique to simulate the
apparent shape and size of quasar ionized bubbles considering the presence of
other ionizing sources and inhomogeneities in the IGM. The presence of other
sources increase the size of the quasar bubble, leading to underestimation of
x_HI. Clustering of other ionizing sources around the quasar can produce severe
distortions in bubble's shape. However, this does not severely affect parameter
estimation in the bubbles that are large.Comment: 18 pages, 16 figures, 3 tables. Minor change in text. Accepted for
publication in MNRA
MSIR@FIRE: A Comprehensive Report from 2013 to 2016
[EN] India is a nation of geographical and cultural diversity where over 1600 dialects are spoken by the people. With the technological advancement, penetration of the internet and cheaper access to mobile data, India has recently seen a sudden growth
of internet users. These Indian internet users generate contents either in English or in other vernacular Indian languages.
To develop technological solutions for the contents generated by the Indian users using the Indian languages, the Forum
for Information Retrieval Evaluation (FIRE) was established and held for the first time in 2008. Although Indian languages
are written using indigenous scripts, often websites and user-generated content (such as tweets and blogs) in these Indian
languages are written using Roman script due to various socio-cultural and technological reasons. A challenge that search
engines face while processing transliterated queries and documents is that of extensive spelling variation. MSIR track was
first introduced in 2013 at FIRE and the aim of MSIR was to systematically formalize several research problems that one must
solve to tackle the code mixing in Web search for users of many languages around the world, develop related data sets, test
benches and most importantly, build a research community focusing on this important problem that has received very little attention. This document is a comprehensive report on the 4 years of MSIR track evaluated at FIRE between 2013 and 2016.Somnath Banerjee and Sudip Kumar Naskar are supported by Media Lab Asia, MeitY, Government of India, under the Visvesvaraya PhD Scheme for Electronics & IT. The work of Paolo Rosso was partially supported by the MISMIS research project PGC2018-096212-B-C31 funded by the Spanish MICINN.Banerjee, S.; Choudhury, M.; Chakma, K.; Kumar Naskar, S.; Das, A.; Bandyopadhyay, S.; Rosso, P. (2020). MSIR@FIRE: A Comprehensive Report from 2013 to 2016. SN Computer Science. 1(55):1-15. https://doi.org/10.1007/s42979-019-0058-0S115155Ahmed UZ, Bali K, Choudhury M, Sowmya VB. Challenges in designing input method editors for Indian languages: the role of word-origin and context. In: Advances in text input methods (WTIM 2011). 2011. pp. 1–9Banerjee S, Chakma K, Naskar SK, Das A, Rosso P, Bandyopadhyay S, Choudhury M. Overview of the mixed script information retrieval (MSIR) at fire-2016. In: Forum for information retrieval evaluation. Springer; 2016. pp. 39–49.Banerjee S, Kuila A, Roy A, Naskar SK, Rosso P, Bandyopadhyay S. A hybrid approach for transliterated word-level language identification: CRF with post-processing heuristics. In: Proceedings of the forum for information retrieval evaluation, ACM, 2014. pp. 54–59.Banerjee S, Naskar S, Rosso P, Bandyopadhyay S. Code mixed cross script factoid question classification—a deep learning approach. J Intell Fuzzy Syst. 2018;34(5):2959–69.Banerjee S, Naskar SK, Rosso P, Bandyopadhyay S. The first cross-script code-mixed question answering corpus. In: Proceedings of the workshop on modeling, learning and mining for cross/multilinguality (MultiLingMine 2016), co-located with the 38th European Conference on Information Retrieval (ECIR). 2016.Banerjee S, Naskar SK, Rosso P, Bandyopadhyay S. Named entity recognition on code-mixed cross-script social media content. Comput Sistemas. 2017;21(4):681–92.Barman U, Das A, Wagner J, Foster J. Code mixing: a challenge for language identification in the language of social media. In: Proceedings of the first workshop on computational approaches to code switching. 2014. pp. 13–23.Bhardwaj P, Pakray P, Bajpeyee V, Taneja A. Information retrieval on code-mixed Hindi–English tweets. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. 2016.Bhargava R, Khandelwal S, Bhatia A, Sharmai Y. Modeling classifier for code mixed cross script questions. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. CEUR-WS.org. 2016.Bhattacharjee D, Bhattacharya, P. Ensemble classifier based approach for code-mixed cross-script question classification. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. CEUR-WS.org. 2016.Chakma K, Das A. CMIR: a corpus for evaluation of code mixed information retrieval of Hindi–English tweets. In: The 17th international conference on intelligent text processing and computational linguistics (CICLING). 2016.Choudhury M, Chittaranjan G, Gupta P, Das A. Overview of fire 2014 track on transliterated search. Proceedings of FIRE. 2014. pp. 68–89.Ganguly D, Pal S, Jones GJ. Dcu@fire-2014: fuzzy queries with rule-based normalization for mixed script information retrieval. In: Proceedings of the forum for information retrieval evaluation, ACM, 2014. pp. 80–85.Gella S, Sharma J, Bali K. Query word labeling and back transliteration for Indian languages: shared task system description. FIRE Working Notes. 2013;3.Gupta DK, Kumar S, Ekbal A. Machine learning approach for language identification and transliteration. In: Proceedings of the forum for information retrieval evaluation, ACM, 2014. pp. 60–64.Gupta P, Bali K, Banchs RE, Choudhury M, Rosso P. Query expansion for mixed-script information retrieval. In: Proceedings of the 37th international ACM SIGIR conference on research and development in information retrieval, ACM, 2014. pp. 677–686.Gupta P, Rosso P, Banchs RE. Encoding transliteration variation through dimensionality reduction: fire shared task on transliterated search. In: Fifth forum for information retrieval evaluation. 2013.HB Barathi Ganesh, M Anand Kumar, KP Soman. Distributional semantic representation for information retrieval. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. 2016.HB Barathi Ganesh, M Anand Kumar, KP Soman. Distributional semantic representation for text classification. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. CEUR-WS.org. 2016.Järvelin K, Kekäläinen J. Cumulated gain-based evaluation of IR techniques. ACM Trans Inf Syst. 2002;20:422–46. https://doi.org/10.1145/582415.582418.Joshi H, Bhatt A, Patel H. Transliterated search using syllabification approach. In: Forum for information retrieval evaluation. 2013.King B, Abney S. Labeling the languages of words in mixed-language documents using weakly supervised methods. In: Proceedings of NAACL-HLT, 2013. pp. 1110–1119.Londhe N, Srihari RK. Exploiting named entity mentions towards code mixed IR: working notes for the UB system submission for MSIR@FIRE’16. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. 2016.Anand Kumar M, Soman KP. Amrita-CEN@MSIR-FIRE2016: Code-mixed question classification using BoWs and RNN embeddings. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. CEUR-WS.org. 2016.Majumder G, Pakray P. NLP-NITMZ@MSIR 2016 system for code-mixed cross-script question classification. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. CEUR-WS.org. 2016.Mandal S, Banerjee S, Naskar SK, Rosso P, Bandyopadhyay S. Adaptive voting in multiple classifier systems for word level language identification. In: FIRE workshops, 2015. pp. 47–50.Mukherjee A, Ravi A , Datta K. Mixed-script query labelling using supervised learning and ad hoc retrieval using sub word indexing. In: Proceedings of the Forum for Information Retrieval Evaluation, Bangalore, India, 2014.Pakray P, Bhaskar P. Transliterated search system for Indian languages. In: Pre-proceedings of the 5th FIRE-2013 workshop, forum for information retrieval evaluation (FIRE). 2013.Patel S, Desai V. Liga and syllabification approach for language identification and back transliteration: a shared task report by da-iict. In: Proceedings of the forum for information retrieval evaluation, ACM, 2014. pp. 43–47.Prabhakar DK, Pal S. Ism@fire-2013 shared task on transliterated search. In: Post-Proceedings of the 4th and 5th workshops of the forum for information retrieval evaluation, ACM, 2013. p. 17.Prabhakar DK, Pal S. Ism@ fire-2015: mixed script information retrieval. In: FIRE workshops. 2015. pp. 55–58.Prakash A, Saha SK. A relevance feedback based approach for mixed script transliterated text search: shared task report by bit Mesra. In: Proceedings of the Forum for Information Retrieval Evaluation, Bangalore, India, 2014.Raj A, Karfa S. A list-searching based approach for language identification in bilingual text: shared task report by asterisk. In: Working notes of the shared task on transliterated search at forum for information retrieval evaluation FIRE’14. 2014.Roy RS, Choudhury M, Majumder P, Agarwal K. Overview of the fire 2013 track on transliterated search. In: Post-proceedings of the 4th and 5th workshops of the forum for information retrieval evaluation, ACM, 2013. p. 4.Saini A. Code mixed cross script question classification. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. CEUR-WS.org. 2016.Salton G, McGill MJ. Introduction to modern information retrieval. New York: McGraw-Hill, Inc.; 1986.Sequiera R, Choudhury M, Gupta P, Rosso P, Kumar S, Banerjee S, Naskar SK, Bandyopadhyay S, Chittaranjan G, Das A, et al. Overview of fire-2015 shared task on mixed script information retrieval. FIRE Workshops. 2015;1587:19–25.Singh S, M Anand Kumar, KP Soman. CEN@Amrita: information retrieval on code mixed Hindi–English tweets using vector space models. In: Working notes of FIRE 2016—forum for information retrieval evaluation, Kolkata, India, December 7–10, 2016, CEUR workshop proceedings. 2016.Sinha N, Srinivasa G. Hindi–English language identification, named entity recognition and back transliteration: shared task system description. In: Working notes os shared task on transliterated search at forum for information retrieval evaluation FIRE’14. 2014.Voorhees EM, Tice DM. The TREC-8 question answering track evaluation. In: TREC-8, 1999. pp. 83–105.Vyas Y, Gella S, Sharma J, Bali K, Choudhury M. Pos tagging of English–Hindi code-mixed social media content. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 2014. pp. 974–979
Detailed study of the ELAIS N1 field with the uGMRT - I. Characterizing the 325 MHz foreground for redshifted 21 cm observations
In this first paper of the series, we present initial results of newly
upgraded Giant Meterwave Radio Telescope (uGMRT) observation of European
Large-Area ISO Survey-North 1 (ELAIS-N1) at 325 MHz with 32 MHz bandwidth.
Precise measurement of fluctuations in Galactic and extragalactic foreground
emission as a function of frequency as well as angular scale is necessary for
detecting redshifted 21-cm signal of neutral hydrogen from Cosmic Dawn, Epoch
of Reionization (EoR) and post-reionization epoch. Here, for the first time we
have statistically quantified the Galactic and extragalactic foreground sources
in the ELAIS-N1 field in the form of angular power spectrum using the newly
developed Tapered Gridded Estimator (TGE). We have calibrated the data with and
without direction-dependent calibration techniques. We have demonstrated the
effectiveness of TGE against the direction dependent effects by using higher
tapering of field of view (FoV). We have found that diffuse Galactic
synchrotron emission (DGSE) dominates the sky, after point source subtraction,
across the angular multipole range and for
direction-dependent and -independent calibrated visibilities respectively. The
statistical fluctuations in DGSE has been quantified as a power law of the form
. The best fitted
values of (A, ) are ( , ) and ( , ) for the two different calibration
approaches. For both the cases, the power law index is consistent with the
previous measurements of DGSE in other parts of sky.Comment: 13 pages, 5figures, 4 tables; accepted for publication in MNRA
Towards 21-cm Intensity Mapping at with uGMRT using the Tapered Gridded Estimator I: Foreground Avoidance
The post-reionization neutral hydrogen (HI) 21-cm intensity
mapping signal holds the potential to probe the large scale structures, study
the expansion history and constrain various cosmological parameters. Here we
apply the Tapered Gridded Estimator (TGE) to estimate
the power spectrum of the redshifted 21-cm signal using a sub-band drawn
from uGMRT Band 3 observations of European Large-Area ISO Survey-North 1
(ELAIS-N1). The TGE allows us to taper the sky response which suppresses the
foreground contribution from sources in the periphery of the telescope's field
of view. We apply the TGE on the measured visibility data to estimate the
multi-frequency angular power spectrum (MAPS) from which
we determine using maximum-likelihood which
naturally overcomes the issue of missing frequency channels (55 \% here). The
entire methodology is validated using simulations. For the data, using the
foreground avoidance technique, we obtain a upper limit of
for the 21-cm brightness
temperature fluctuation at . This corresponds
to , where and respectively denote the cosmic \HI mass density and the \HI bias
parameter. A previous work has analyzed of the same data at
, and reported and
at . The upper
limits presented here are still orders of magnitude larger than the expected
signal corresponding to and .Comment: 13 pages, 11 figures, accepted for publication in MNRA
Towards -cm intensity mapping at with uGMRT using the tapered gridded estimator III: Foreground removal
Neutral hydrogen (\ion{H}{i}) -cm intensity mapping (IM) is a promising
probe of the large-scale structures in the Universe. However, a few orders of
magnitude brighter foregrounds obscure the IM signal. Here we use the Tapered
Gridded Estimator (TGE) to estimate the multi-frequency angular power spectrum
(MAPS) from a bandwidth uGMRT Band
data at . In foregrounds remain
correlated across the entire range, whereas the -cm signal is
localized within (typically ).
Assuming the range to have minimal -cm signal, we
use in this range to model the foregrounds. This
foreground model is extrapolated to , and subtracted
from the measured . The residual
in the range is
used to constrain the -cm signal, compensating for the signal loss from
foreground subtraction. is found to be
noise-dominated without any trace of foregrounds. Using
we constrain the -cm brightness
temperature fluctuations , and obtain the upper limit
at . We
further obtain the upper limit
[\Omega_{\ion{H}{i}}b_{\ion{H}{i}}]_{\rm UL}\leq0.022 where
\Omega_{\ion{H}{i}} and b_{\ion{H}{i}} are the comoving \ion{H}{i} density
and bias parameters respectively. Although the upper limit is nearly times
larger than the expected -cm signal, it is times tighter over previous
works using foreground avoidance on the same data.Comment: Accepted for publication in MNRAS. 16 pages (including Appendix), 8
figures (plus 8 in Appendix), 5 Table
Towards -cm intensity mapping at with uGMRT using the tapered gridded estimator -- IV. Wideband analysis
We present a Wideband Tapered Gridded Estimator (TGE), which incorporates
baseline migration and variation of the primary beam pattern for neutral
hydrogen () 21-cm intensity mapping
(IM) with large frequency bandwidth radio-interferometric observations. Here we
have analysed uGMRT data to estimate
the Multi-frequency Angular Power Spectrum (MAPS) from
which we have removed the foregrounds using the polynomial fitting (PF) and
Gaussian Process Regression (GPR) methods developed in our earlier work. Using
the residual to estimate the mean squared 21-cm brightness
temperature fluctuation , we find that this is consistent with in several bins. The resulting upper limit
at is nearly
times tighter than earlier limits obtained from a smaller bandwidth () of the same data. The upper limit is within an order of magnitude of the value
expected from independent estimates of the mass density
and the bias . The techniques used here can be
applied to other telescopes and frequencies, including
Epoch of Reionization observations.Comment: Accepted for publication in MNRA
- …