15,493 research outputs found

    Effective Blog Pages Extractor for Better UGC Accessing

    Full text link
    Blog is becoming an increasingly popular media for information publishing. Besides the main content, most of blog pages nowadays also contain noisy information such as advertisements etc. Removing these unrelated elements can improves user experience, but also can better adapt the content to various devices such as mobile phones. Though template-based extractors are highly accurate, they may incur expensive cost in that a large number of template need to be developed and they will fail once the template is updated. To address these issues, we present a novel template-independent content extractor for blog pages. First, we convert a blog page into a DOM-Tree, where all elements including the title and body blocks in a page correspond to subtrees. Then we construct subtree candidate set for the title and the body blocks respectively, and extract both spatial and content features for elements contained in the subtree. SVM classifiers for the title and the body blocks are trained using these features. Finally, the classifiers are used to extract the main content from blog pages. We test our extractor on 2,250 blog pages crawled from nine blog sites with obviously different styles and templates. Experimental results verify the effectiveness of our extractor.Comment: 2016 3rd International Conference on Information Science and Control Engineering (ICISCE

    The Mechanisms of Electron Acceleration During Multiple X Line Magnetic Reconnection with a Guide Field

    Full text link
    The interactions between magnetic islands are considered to play an important role in electron acceleration during magnetic reconnection. In this paper, two-dimensional (2-D) particle-in-cell (PIC) simulations are performed to study electron acceleration during multiple X line reconnection with a guide field. The electrons remain almost magnetized, and we can then analyze the contributions of the parallel electric field, Fermi and betatron mechanisms to electron acceleration during the evolution of magnetic reconnection by comparing with a guide-center theory. The results show that with the proceeding of magnetic reconnection, two magnetic islands are formed in the simulation domain. The electrons are accelerated by both the parallel electric field in the vicinity of the X lines and Fermi mechanism due to the contraction of the two magnetic islands. Then the two magnetic islands begin to merge into one, and in such a process electrons can be accelerated by the parallel electric field and betatron mechanisms. During the betatron acceleration, the electrons are locally accelerated in the regions where the magnetic field is piled up by the high-speed flow from the X line. At last, when the coalescence of the two islands into a big one finishes, electrons can further be accelerated by the Fermi mechanism because of the contraction of the big island. With the increase of the guide field, the contributions of Fermi and betatron mechanisms to electron acceleration become less and less important. When the guide field is sufficiently large, the contributions of Fermi and betatron mechanisms are almost negligible.Comment: 22 pages, 5 figures, accepted by The Astrophysical Journa

    Interference Channel with Intermittent Feedback

    Full text link
    We investigate how to exploit intermittent feedback for interference management. Focusing on the two-user linear deterministic interference channel, we completely characterize the capacity region. We find that the characterization only depends on the forward channel parameters and the marginal probability distribution of each feedback link. The scheme we propose makes use of block Markov encoding and quantize-map-and-forward at the transmitters, and backward decoding at the receivers. Matching outer bounds are derived based on novel genie-aided techniques. As a consequence, the perfect-feedback capacity can be achieved once the two feedback links are active with large enough probabilities.Comment: Extended version of the same-titled paper that appears in IEEE International Symposium on Information Theory (ISIT) 201
    corecore