Loading...

Table of Content

15 April 2025 Volume 38 Issue 4
  
    Outdoor Garbage Detection and Recognition Based on Dual Branch Networks
    ZHAO Wenqi, ZHANG Lixin, KAN Xi, ZHENG Haoren
    Electronic Science and Technology. 2025, 38(4):  1-9.  doi:10.16180/j.cnki.issn1007-7820.2025.04.001
    Abstract ( 439 )   HTML ( 69 )   PDF (7102KB) ( 149 )  
    Figures and Tables | References | Related Articles | Metrics

    The existing outdoor waste detection algorithms do not fully consider the advantages and disadvantages of CNN (Convolutional Neural Network) and Transformer in feature extraction, which limits the overall performance of the network. This study proposes a two-branch fusion network detection algorithm composed of CNN and Transformer. In the coding stage, a two-branch backbone network is constructed based on the advantages of CNN and Transformer to extract the feature information of the original image. Multi-scale convolutional module and multi-scale pooling module are used to eliminate the differences in dimension and semantics of extracted feature information, and the loss of detail information in deep neural network is reduced by strengthening feature extraction network. Six types of outdoor garbage images are collected, and a data set of garbage images with complex background is built to verify the performance of the proposed algorithm in outdoor garbage detection and recognition task. The experimental results show that the mAP(mean Average Precision) of the proposed algorithm on this data set is improved by about 5% when compared with the latest target detection algorithm. In order to verify the generalization performance of the proposed algorithm, a generalization experiment is carried out on the Huawei garbage classification challenge cup data set, and the experimental results show that mAP of the proposed algorithm is improved by about 2% when compared with the latest object detection algorithm.

    High-Frequency Pulse Discharge Characteristics in Saline Solution
    DONG Shanlin, LI Zi
    Electronic Science and Technology. 2025, 38(4):  10-15.  doi:10.16180/j.cnki.issn1007-7820.2025.04.002
    Abstract ( 355 )   HTML ( 13 )   PDF (2094KB) ( 76 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the characteristics of high frequency pulse discharge in saline solution, a pin-needle reactor structure is designed to study the discharge characteristics of pin-needle discharge under pulse voltage. The different electrical characteristics and experimental characteristics are tested and analyzed, and the relationship between impedance and voltage amplitude from no discharge to discharge is measured. It is proved that the change of steam layer can affect discharge. In the plasma emission spectrum of discharge, the characteristic emission band of OH free radical in the band of 306~312 nm affects the biological tissue excision. The finite element multi-physics model of the discharge is established. The results show that the thickness of the vapor layer is about 100 μm and the electron density of the plasma is about 1016 cm-3.

    Multi-Class Defect Target Detection for Transmission Lines Based on Improved YOLOv7
    BI Hanjia, YANG Churui, WANG Xiaoyu, HUANG Yuehua
    Electronic Science and Technology. 2025, 38(4):  16-24.  doi:10.16180/j.cnki.issn1007-7820.2025.04.003
    Abstract ( 490 )   HTML ( 18 )   PDF (3996KB) ( 100 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the low detection accuracy of multi-scale defect targets in transmission lines under complex background, an improved YOLOv7(You Only Look Once version7) defect target detection model for transmission lines is proposed. To solve the problem of low defect targets caused by complex background, an improved Swin Transformer module is introduced in the Backbone part to improve the detection accuracy of the model using multi-head attention mechanism to improve the effect of global feature extraction. According to the multi-scale characteristics of the target to be detected, an adaptive feature fusion module is introduced on the basis of the feature pyramid to improve the detection ability of the Neck partial feature fusion network on multiple defect targets of different scales. SIoU(Structured Intersection over Union) loss function is used to improve the accuracy of prediction frame regression and accelerate the model convergence. Experimental results show that compared with YOLOv5, YOLOv7 and Faster R-CNN(Faster Region Proposal Convolutional Neural Network) models, the improved YOLOv7 model has higher detection accuracy, with an average detection accuracy of 96.4% and a detection speed of 29.6 frame∙s-1, which can provide reference for the detection of multiple types of defect targets of transmission lines.

    Seismic Data Denoising Method Based on CycleGAN
    FU Peng, SONG Xiaoxia
    Electronic Science and Technology. 2025, 38(4):  25-30.  doi:10.16180/j.cnki.issn1007-7820.2025.04.004
    Abstract ( 448 )   HTML ( 11 )   PDF (2052KB) ( 57 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the problem that the actual seismic data is interfered by a large amount of random noise and it is difficult to obtain paired noise-free data, this study proposes a random noise suppression method of seismic data based on CycleGAN(Cycle Generative Adversarial Network) to obtain high-quality seismic data. The residual network is introduced into the generative network of cyclic generative adversarial network, and the training speed of the network is accelerated by jumping connection, and the convolution layer in the residual block is expanded and the structure of the residual block is enhanced to obtain the sample features better. Experiments are conducted with synthetic data and actual data respectively, and evaluation indexes such as SNR(Signal to Noise Ratio) and MSE(Mean Square Error) are used to verify the denoising effect. The results show that compared with CNN, the SNR, MSE and PSNR(Peak Signal-to-Noise Ratio) of the proposed method increased by 0.59 dB, 23.72 and 2.81 dB respectively, in the synthetic data experiment. In the actual data experiment, the increase is 4.63 dB, 1.13 and 0.77 dB, respectively, and the training time is reduced by about 58%.

    DEVAE-GAN:A Generative Model of the fNIRS Data under Multiple Levels of the Cognitive Workload
    CHEN Li, MA Zhuang, YIN Zhong
    Electronic Science and Technology. 2025, 38(4):  31-38.  doi:10.16180/j.cnki.issn1007-7820.2025.04.005
    Abstract ( 383 )   HTML ( 5 )   PDF (2279KB) ( 23 )  
    Figures and Tables | References | Related Articles | Metrics

    The application of deep learning methods to fNIRS(functional Near-Infrared Spectroscopy) has become a research hotspot in the field of brain computer interface, but less available data limits the performance of deep learning models. A method for generating fNIRS original signal is proposed based on DEVAE-GAN(Dual-Encoder-Variational Autoencoder-Generative Adversarial Network) in this study. In this method, the pre-processed fNIRS signals are converted into time and space representations, input into a dual encoder to extract time and space information, splice two pieces of information, and send to the decoder to generate samples. In order to verify its effectiveness, experiments are conducted on public data sets of mental load tasks, and different numbers of generated samples are extended to the training data set, and the enhanced data set is used to train the deep neural network. Compared with multiple baseline generation models, the proposed method generates the highest sample quality, and the average classification accuracy of all subjects after using this method is 95.86%, which is increased by 0.91% when compared with the original data set. The experimental results show that the proposed method can effectively learn the distribution of raw data of mental load task fNIRS, generate high-quality samples, and improve the performance of deep learning models.

    The User Fall Detection Algorithm for Walking-Aid Robot Based on Machine Learning
    HU Jiawei, WANG Yagang
    Electronic Science and Technology. 2025, 38(4):  39-45.  doi:10.16180/j.cnki.issn1007-7820.2025.04.006
    Abstract ( 368 )   HTML ( 11 )   PDF (2348KB) ( 44 )  
    Figures and Tables | References | Related Articles | Metrics

    In order to prevent users from falling and improve the safety of walking robot, a fall detection algorithm of walking robot based on improved support vector machine model is proposed. The displacement and velocity data sets are obtained by the pressure sensor and ultrasonic ranging sensor installed on the walking robot, and the user's motion intention is obtained by data fusion using improved Kalman filter, and the acceleration data features are extracted. Based on the improved support vector machine model, the output probability and risk function optimization are carried out to improve the accuracy of classification threshold. The simulation results show that the accuracy of the proposed algorithm is 98.68%, the sensitivity and specificity are 96.66% and 90.34%, respectively. The proposed algorithm has a good comprehensive performance index and can trigger the defense protection mechanism within 500 ms.

    Brain Tumor Classification Algorithm Based on Transfer Learning and Improved EfficientNet-B0
    WANG Yong, YANG Yilong, FAN Xiaohui, ZHOU Lei, KONG Xiangyong
    Electronic Science and Technology. 2025, 38(4):  46-51.  doi:10.16180/j.cnki.issn1007-7820.2025.04.007
    Abstract ( 452 )   HTML ( 10 )   PDF (2148KB) ( 50 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the problems of high complexity and low recognition rate of existing brain tumor classification models and methods, an model based on improved EfficientNet-B0 is proposed for the classification of three brain tumors. In the data preprocessing stage, the ROI (Region of Interest) feature is used to cut out the key feature regions of brain tumor images, and the data set is expanded according to tumor type. The MBConv(Mobile Inverted Bottleneck Convolution) module in EfficientNet is redesigned according to the convolutional network design idea, and the CBAM(Convolutional Block Attention Module) is introduced after the first convolution step. In order to carry out transfer learning more completely, three neurons are attached to the brain tumors without modifying the original output structure. The improved network model has lower complexity and better adapts to the identification of tumor lesions. The transfer learning method is used to fine-tune the public data set figshare-Brain Tumor Dataset. Experimental results show that the improved model achieves a classification accuracy of 99.67% on the public data set, which is about 3.1 percentage points higher than the original EfficientNet-B0 network.

    Research on Urban Waterlogging Simulation and Monitoring Platform Based on Container Service
    LI Linjie, LIU Deer
    Electronic Science and Technology. 2025, 38(4):  52-58.  doi:10.16180/j.cnki.issn1007-7820.2025.04.008
    Abstract ( 293 )   HTML ( 4 )   PDF (4125KB) ( 29 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the problem that the urban waterlogging can't perceive the flooding situation, a simulation and monitoring platform for urban waterlogging is proposed. Through this platform, the information of waterlogging points, the safety of vehicle traffic and the dynamic process of simulating flooding can be known in real time. According to the decoupling idea, the platform is divided into drainage pipe network module, waterlogging simulation module, video fusion module and vehicle safety identification module, which communicate with each other through API(Application Programming Interface) interface. Back-end builds API interface through Python's Web application framework, and Cesium front-end calls API interface to show the effect. In order to facilitate the service expansion and dynamic adjustment in the later stage of the platform, after the platform is developed, all functional modules of the platform are packaged into images by Docker and uploaded to the cloud warehouse. The image service is run through the server cluster and the cluster is managed by Rancher to achieve the containerized deployment effect.

    Research on PID Parameter Tuning of Switch Reluctance Motor Based on Whale Optimization Algorithm
    WEI Runtong, WANG Zhichong, LI Xueqi
    Electronic Science and Technology. 2025, 38(4):  59-65.  doi:10.16180/j.cnki.issn1007-7820.2025.04.009
    Abstract ( 317 )   HTML ( 7 )   PDF (3942KB) ( 38 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the problems of poor stability and difficult parameter adjustment in switched reluctance motor PID(Proportional Integral Derivative) controller, the whale optimization algorithm is introduced based on traditional switched reluctance motor controller, and improved integral time absolute error function is used as fitness function to adjust three control parameters of Kp, Ki and Kd.A parameter tuning system of three-phase 6/4-pole switched reluctance motor is built on MATLAB/Simulink simulation platform. The effect of traditional empirical parameter tuning algorithm is analyzed and compared with the results of particle swarm optimization, genetic algorithm and gray wolf optimization. The simulation results show that the PID parameters obtained by the proposed method are more accurate and its effect is better than the three comparison algorithms.Compared with the empirical method, the response speed of the whale algorithm is increased by 51.10%, and the error is reduced by 0.67%, which makes the speed control system have faster and more stable response characteristics.

    Design of Audio Acquisition System for Acoustic Camera Based on FPGA
    WEN Jinlong, SHI Wei, HU Dingyu, LIAO Aihua
    Electronic Science and Technology. 2025, 38(4):  66-72.  doi:10.16180/j.cnki.issn1007-7820.2025.04.010
    Abstract ( 504 )   HTML ( 10 )   PDF (10982KB) ( 90 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the low signal acquisition efficiency and high hardware logic complexity of the acoustic camera audio acquisition scheme based on FPGA(Field Programmable Gate Array), an efficient acoustic camera audio acquisition system design scheme based on FPGA is proposed in this study. Inside the FPGA, a cascaded and efficient decoding filter is utilized to decode the multi-channel PDM(Pulse Density Modulation) audio signal stream to reduce the workload of hardware calculation. FPGA collects the PDM data stream of multi-channel MEMS(Micro Electro Mechanical Systems) microphones for bit splicing and complementing into signed numbers. Each frame data packet is pipelinized in each cascade filter through AXI4-Stream bus protocol. The entire decoding process runs in parallel to simplify hardware logic complexity. The decoded data is transmitted over Gigabit Ethernet using UDP(User Datagram Protocol) to ensure real-time transmission. Simulation and experimental results verify the feasibility and effectiveness of the proposed scheme. The data decoding delay is less than 200 μs, the hardware logic resource occupancy is less than 60%, the data packet loss rate is 0%, which demonstrate that the proposed method has high real-time performance and strong stability and can provide more beneficial enlightenment for the design of FPGA-based acoustic camera.

    Transformer Fault Identification Method Based on Gramian Angle Difference Field and CNN-BiGRU
    XU Yaobo, YANG Xinqiang, XU Guangchao, YANG Shihao, DUAN Guoyong
    Electronic Science and Technology. 2025, 38(4):  73-79.  doi:10.16180/j.cnki.issn1007-7820.2025.04.011
    Abstract ( 406 )   HTML ( 7 )   PDF (2496KB) ( 38 )  
    Figures and Tables | References | Related Articles | Metrics

    In view of the problems such as the difficulty in extracting the fault characteristics of transformer windings and the relatively low diagnostic accuracy, this study proposes a transformer fault identification method based on the GADF(Gramian Angular Difference Field) and the CNN-BiGRU(Convolutional Neural Network-Bidirectional Gated Recurrent Unit) on the basis of the frequency response curve. In response to the problem that the original features have a small discriminative ability for different fault types, a moving window calculation method is proposed to process the sample segments. By combining with the Gram angular difference field transformation, the spectral features are obtained, realizing the mapping of one-dimensional data into three-dimensional image data. The distribution characteristics of different fault types in the spectral features are analyzed. Taking the obtained spectral features as the input, the fault segment data are classified through the recurrent convolutional neural network to obtain the identification results. Compared with the traditional methods, the proposed method has more obvious feature differences, and the accuracy is further improved. The simulation results show that the classification accuracy of the slices reaches 96.2%, and the high accuracy of the diagnostic results verifies the feasibility of this method.

    Self-Activated Learning Method for Weakly Supervised Semantic Segmentation Integrating Attention Mechanism
    ZHOU Kai, YU Lianzhi
    Electronic Science and Technology. 2025, 38(4):  80-86.  doi:10.16180/j.cnki.issn1007-7820.2025.04.012
    Abstract ( 326 )   HTML ( 7 )   PDF (1641KB) ( 41 )  
    Figures and Tables | References | Related Articles | Metrics

    Weakly supervised semantic segmentation is typically trained by class activation maps, but there is a significant gap between class activation maps and real pixel-level labels. A self-activation model of weakly supervised semantic segmentation based on attention mechanism is proposed to solve the problem that class activation maps of weakly supervised semantic segmentation have little positioning information and rough contour of segmentation results. The implicit constraints in the full supervision method are introduced by affine changes, the shallow information of the classification network is extracted and the attention mechanism is integrated. The enhanced shallow information is used to refine the outline of the class activation diagram, and the feature diagram is self-activated according to the generated class activation diagram, so as to generate the final class activation diagram. Experiments on the PASCAL VOC 2012 data set show a 1.7% improvement in the average crossover ratio of class activation graphs and a 2.4% improvement in the average crossover ratio of final segmentation results compared to recent advanced models. The effectiveness of each module is verified by the ablation experiments.

Download

Monthly,Founded in September 1987
Competent Authorities:
Ministry of Education of the People's Republic of China
Sponsored by:Xidian University
Chief Editor:Liao Guisheng
Executive Editor:Wan Liancheng
Editor:Hei Lei
Editor and Publisher:
The Editorial Department of Electronic Science and Technology
Distribution Abroad:
China Intermational Book Trading Corporation
P.O.BOx 399,Beijing 100044,China
Address:
P.O.Box 375,2 Taibai Road(South),Xi'an 710071,China
Tel/Fax:0086-029-88202440
Website:http://www.dianzikeji.org
E-mail:dzkj@mail.xidian.edu.cn
Unit Price:$20.00