117 research outputs found
Radio frequency mixing modules for superconducting qubit room temperature control systems
As the number of qubits in nascent quantum processing units increases, the
connectorized RF (radio frequency) analog circuits used in first generation
experiments become exceedingly complex. The physical size, cost and electrical
failure rate all become limiting factors in the extensibility of control
systems. We have developed a series of compact RF mixing boards to address this
challenge by integrating I/Q quadrature mixing, IF(intermediate
frequency)/LO(local oscillator)/RF power level adjustments, and DC (direct
current) bias fine tuning on a 40 mm 80 mm 4-layer PCB (printed
circuit board) board with EMI (electromagnetic interference) shielding. The RF
mixing module is designed to work with RF and LO frequencies between 2.5 and
8.5 GHz. The typical image rejection and adjacent channel isolation are
measured to be 27 dBc and 50 dB. By scanning the drive phase in a
loopback test, the module short-term amplitude and phase linearity are
typically measured to be 510
(V/V) and 110 radian (pk-pk).
The operation of RF mixing board was validated by integrating it into the room
temperature control system of a superconducting quantum processor and executing
randomized benchmarking characterization of single and two qubit gates. We
measured a single-qubit process infidelity of and a
two-qubit process infidelity of .Comment: Updated the title. Added the git repository of RF mixing modules
design. Added the explanation for SRB. Added funding agenc
DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors
Camera-based 3D object detectors are welcome due to their wider deployment
and lower price than LiDAR sensors. We revisit the prior stereo modeling DSGN
about the stereo volume constructions for representing both 3D geometry and
semantics. We polish the stereo modeling and propose our approach, DSGN++,
aiming for improving information flow throughout the 2D-to-3D pipeline in the
following three main aspects. First, to effectively lift the 2D information to
stereo volume, we propose depth-wise plane sweeping (DPS) that allows denser
connections and extracts depth-guided features. Second, for better grasping
differently spaced features, we present a novel stereo volume -- Dual-view
Stereo Volume (DSV) that integrates front-view and top-view features and
reconstructs sub-voxel depth in the camera frustum. Third, as the foreground
region becomes less dominant in 3D space, we firstly propose a multi-modal data
editing strategy -- Stereo-LiDAR Copy-Paste, which ensures cross-modal
alignment and improves data efficiency. Without bells and whistles, extensive
experiments in various modality setups on the popular KITTI benchmark show that
our method consistently outperforms other camera-based 3D detectors for all
categories. Code will be released at https://github.com/chenyilun95/DSGN2
CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed Graphs
Legal case retrieval is an information retrieval task in the legal domain,
which aims to retrieve relevant cases with a given query case. Recent research
of legal case retrieval mainly relies on traditional bag-of-words models and
language models. Although these methods have achieved significant improvement
in retrieval accuracy, there are still two challenges: (1) Legal structural
information neglect. Previous neural legal case retrieval models mostly encode
the unstructured raw text of case into a case representation, which causes the
lack of important legal structural information in a case and leads to poor case
representation; (2) Lengthy legal text limitation. When using the powerful
BERT-based models, there is a limit of input text lengths, which inevitably
requires to shorten the input via truncation or division with a loss of legal
context information. In this paper, a graph neural networks-based legal case
retrieval model, CaseGNN, is developed to tackle these challenges. To
effectively utilise the legal structural information during encoding, a case is
firstly converted into a Text-Attributed Case Graph (TACG), followed by a
designed Edge Graph Attention Layer and a readout function to obtain the case
graph representation. The CaseGNN model is optimised with a carefully designed
contrastive loss with easy and hard negative sampling. Since the text
attributes in the case graph come from individual sentences, the restriction of
using language models is further avoided without losing the legal context.
Extensive experiments have been conducted on two benchmarks from COLIEE 2022
and COLIEE 2023, which demonstrate that CaseGNN outperforms other
state-of-the-art legal case retrieval methods. The code has been released on
https://github.com/yanran-tang/CaseGNN
DAMO-YOLO : A Report on Real-Time Object Detection Design
In this report, we present a fast and accurate object detection method dubbed
DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO
series. DAMO-YOLO is extended from YOLO with some new technologies, including
Neural Architecture Search (NAS), efficient Reparameterized Generalized-FPN
(RepGFPN), a lightweight head with AlignedOTA label assignment, and
distillation enhancement. In particular, we use MAE-NAS, a method guided by the
principle of maximum entropy, to search our detection backbone under the
constraints of low latency and high performance, producing ResNet/CSP-like
structures with spatial pyramid pooling and focus modules. In the design of
necks and heads, we follow the rule of ``large neck, small head''.We import
Generalized-FPN with accelerated queen-fusion to build the detector neck and
upgrade its CSPNet with efficient layer aggregation networks (ELAN) and
reparameterization. Then we investigate how detector head size affects
detection performance and find that a heavy neck with only one task projection
layer would yield better results.In addition, AlignedOTA is proposed to solve
the misalignment problem in label assignment. And a distillation schema is
introduced to improve performance to a higher level. Based on these new techs,
we build a suite of models at various scales to meet the needs of different
scenarios. For general industry requirements, we propose DAMO-YOLO-T/S/M/L.
They can achieve 43.6/47.7/50.2/51.9 mAPs on COCO with the latency of
2.78/3.83/5.62/7.95 ms on T4 GPUs respectively. Additionally, for edge devices
with limited computing power, we have also proposed DAMO-YOLO-Ns/Nm/Nl
lightweight models. They can achieve 32.3/38.2/40.5 mAPs on COCO with the
latency of 4.08/5.05/6.69 ms on X86-CPU. Our proposed general and lightweight
models have outperformed other YOLO series models in their respective
application scenarios.Comment: Project Website: https://github.com/tinyvision/damo-yol
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
The rapid advances in Vision Transformer (ViT) refresh the state-of-the-art
performances in various vision tasks, overshadowing the conventional CNN-based
models. This ignites a few recent striking-back research in the CNN world
showing that pure CNN models can achieve as good performance as ViT models when
carefully tuned. While encouraging, designing such high-performance CNN models
is challenging, requiring non-trivial prior knowledge of network design. To
this end, a novel framework termed Mathematical Architecture Design for Deep
CNN (DeepMAD) is proposed to design high-performance CNN models in a principled
way. In DeepMAD, a CNN network is modeled as an information processing system
whose expressiveness and effectiveness can be analytically formulated by their
structural parameters. Then a constrained mathematical programming (MP) problem
is proposed to optimize these structural parameters. The MP problem can be
easily solved by off-the-shelf MP solvers on CPUs with a small memory
footprint. In addition, DeepMAD is a pure mathematical framework: no GPU or
training data is required during network design. The superiority of DeepMAD is
validated on multiple large-scale computer vision benchmark datasets. Notably
on ImageNet-1k, only using conventional convolutional layers, DeepMAD achieves
0.7% and 1.5% higher top-1 accuracy than ConvNeXt and Swin on Tiny level, and
0.8% and 0.9% higher on Small level.Comment: Accepted by CVPR 202
Enhancing Model Performance in Multilingual Information Retrieval with Comprehensive Data Engineering Techniques
In this paper, we present our solution to the Multilingual Information
Retrieval Across a Continuum of Languages (MIRACL) challenge of WSDM CUP
2023\footnote{https://project-miracl.github.io/}. Our solution focuses on
enhancing the ranking stage, where we fine-tune pre-trained multilingual
transformer-based models with MIRACL dataset. Our model improvement is mainly
achieved through diverse data engineering techniques, including the collection
of additional relevant training data, data augmentation, and negative sampling.
Our fine-tuned model effectively determines the semantic relevance between
queries and documents, resulting in a significant improvement in the efficiency
of the multilingual information retrieval process. Finally, Our team is pleased
to achieve remarkable results in this challenging competition, securing 2nd
place in the Surprise-Languages track with a score of 0.835 and 3rd place in
the Known-Languages track with an average nDCG@10 score of 0.716 across the 16
known languages on the final leaderboard
Efficient Generation of Multi-partite Entanglement between Non-local Superconducting Qubits using Classical Feedback
Quantum entanglement is one of the primary features which distinguishes
quantum computers from classical computers. In gate-based quantum computing,
the creation of entangled states or the distribution of entanglement across a
quantum processor often requires circuit depths which grow with the number of
entangled qubits. However, in teleportation-based quantum computing, one can
deterministically generate entangled states with a circuit depth that is
constant in the number of qubits, provided that one has access to an entangled
resource state, the ability to perform mid-circuit measurements, and can
rapidly transmit classical information. In this work, aided by fast classical
FPGA-based control hardware with a feedback latency of only 150 ns, we explore
the utility of teleportation-based protocols for generating non-local,
multi-partite entanglement between superconducting qubits. First, we
demonstrate well-known protocols for generating Greenberger-Horne-Zeilinger
(GHZ) states and non-local CNOT gates in constant depth. Next, we utilize both
protocols for implementing an unbounded fan-out (i.e., controlled-NOT-NOT) gate
in constant depth between three non-local qubits. Finally, we demonstrate
deterministic state teleportation and entanglement swapping between qubits on
opposite side of our quantum processor
- …