15,493 research outputs found
Effective Blog Pages Extractor for Better UGC Accessing
Blog is becoming an increasingly popular media for information publishing.
Besides the main content, most of blog pages nowadays also contain noisy
information such as advertisements etc. Removing these unrelated elements can
improves user experience, but also can better adapt the content to various
devices such as mobile phones. Though template-based extractors are highly
accurate, they may incur expensive cost in that a large number of template need
to be developed and they will fail once the template is updated. To address
these issues, we present a novel template-independent content extractor for
blog pages. First, we convert a blog page into a DOM-Tree, where all elements
including the title and body blocks in a page correspond to subtrees. Then we
construct subtree candidate set for the title and the body blocks respectively,
and extract both spatial and content features for elements contained in the
subtree. SVM classifiers for the title and the body blocks are trained using
these features. Finally, the classifiers are used to extract the main content
from blog pages. We test our extractor on 2,250 blog pages crawled from nine
blog sites with obviously different styles and templates. Experimental results
verify the effectiveness of our extractor.Comment: 2016 3rd International Conference on Information Science and Control
Engineering (ICISCE
The Mechanisms of Electron Acceleration During Multiple X Line Magnetic Reconnection with a Guide Field
The interactions between magnetic islands are considered to play an important
role in electron acceleration during magnetic reconnection. In this paper,
two-dimensional (2-D) particle-in-cell (PIC) simulations are performed to study
electron acceleration during multiple X line reconnection with a guide field.
The electrons remain almost magnetized, and we can then analyze the
contributions of the parallel electric field, Fermi and betatron mechanisms to
electron acceleration during the evolution of magnetic reconnection by
comparing with a guide-center theory. The results show that with the proceeding
of magnetic reconnection, two magnetic islands are formed in the simulation
domain. The electrons are accelerated by both the parallel electric field in
the vicinity of the X lines and Fermi mechanism due to the contraction of the
two magnetic islands. Then the two magnetic islands begin to merge into one,
and in such a process electrons can be accelerated by the parallel electric
field and betatron mechanisms. During the betatron acceleration, the electrons
are locally accelerated in the regions where the magnetic field is piled up by
the high-speed flow from the X line. At last, when the coalescence of the two
islands into a big one finishes, electrons can further be accelerated by the
Fermi mechanism because of the contraction of the big island. With the increase
of the guide field, the contributions of Fermi and betatron mechanisms to
electron acceleration become less and less important. When the guide field is
sufficiently large, the contributions of Fermi and betatron mechanisms are
almost negligible.Comment: 22 pages, 5 figures, accepted by The Astrophysical Journa
Interference Channel with Intermittent Feedback
We investigate how to exploit intermittent feedback for interference
management. Focusing on the two-user linear deterministic interference channel,
we completely characterize the capacity region. We find that the
characterization only depends on the forward channel parameters and the
marginal probability distribution of each feedback link. The scheme we propose
makes use of block Markov encoding and quantize-map-and-forward at the
transmitters, and backward decoding at the receivers. Matching outer bounds are
derived based on novel genie-aided techniques. As a consequence, the
perfect-feedback capacity can be achieved once the two feedback links are
active with large enough probabilities.Comment: Extended version of the same-titled paper that appears in IEEE
International Symposium on Information Theory (ISIT) 201
- …
