Mobile QR Code QR CODE

2024

Acceptance Ratio

21%


  1. (School of Design Art, Changsha University of Science & Technology, Changsha, 410114, China)



Q-learning algorithm, AI, Relief craft, 3D printing technology, Path planning

1. Introduction

As a typical representative of refined and personalized production in modern manufacturing, the three-dimensional (3D) printing of relief technology is gradually maturing. Faced with the continuous precision control and optimization problems during the printing process, the quality and aesthetics of products are affected [1,2]. In addition, with the development of materials science, new materials have brought more possibilities to 3D printing technology, which also raised higher requirements for evaluating the accuracy and generalization ability of models [3,4]. In the existing technology, the precision control and optimization problems of relief 3D printing technology are constantly emerging, which affects product quality and aesthetics. Moreover, the evaluation model of printing technology is difficult to achieve optimal printing quality when facing changing printing conditions and complex geometric structures. In addition, existing evaluation models for relief 3D printing technology often rely on empirical judgment or fixed parameter settings, lacking adaptability and dynamic optimization capabilities for complex printing tasks. The Q-learning algorithm is a decision algorithm suitable for both non-deterministic and dynamic environments. Its application in parameter optimization of relief 3D printing is expected to play a crucial role. By learning and adjusting printing strategies in real-time, Q-learning algorithms can self learn and adapt, continuously optimizing parameters during the printing process to optimize printing quality. This algorithm helps to solve the challenges in existing technologies, providing new ideas for the practical application of artificial intelligence (AI) in non-standard production processes [5,6]. Therefore, the study proposes an evaluation model that can learn and adjust printing strategies in real-time. The Q-learning algorithm learns the optimal behavior strategy through interaction with the environment. It is expected to play a key role in optimizing the parameters of relief 3D printing. Through this approach, the model can continuously learn during the printing process, automatically adjust parameters based on the printing results, and optimize printing quality.

The research is composed of four parts. The first part introduces the background of the combination of relief technology and 3D printing technology, explores its application in artistic creation and cultural inheritance, as well as the existing technical challenges. The second part reviews the current application status of AI in 3D printing technology, especially the research progress in relief artworks. The third part elaborates on the design of the AI model for path planning based on Q-learning algorithm. The fourth part analyzes the effectiveness of Q-learning algorithm in the evaluation model of 3D printing technology in relief craft, and explores its application effect in actual artistic creation. The fifth part summarizes the research outcomes of the entire article and points out the potential improvement space of the model. The terminology names involved in the research are shown in Table 1.

Table 1. Nomenclature of terms.

Abbreviation

Full name

3D

Three-dimensional

AI

Artificial Intelligence

AFM

Atomic Force Microscope

PSO

Particle Swarm Optimization

AI-AHP

AI-Analytic Hierarchy Process

IHSA

Improved Heuristic-based Search Algorithm

ACOPP

Ant Colony Optimization for Path Planning

GA

Genetic Algorithm

2. Related Works

With the widespread application of 3D printing technology in the fields of art restoration and cultural relic protection, the evaluation model of relief 3D printing technology that integrates Q-learning algorithm is receiving high attention from researchers. Li J et al. proposed an algorithm for full-color 3D printing by preparing water-based binders and polymeric powders to solve nozzle clogging and dimensional accuracy problems. The results showed that the optimized binder powder system solved nozzle blockage, resulting in a size error of less than 2.5%, and verifying the applicability of the halftone algorithm. The halftone algorithm used provides a feasible solution for the implementation of full-color 3D printing [7]. Yang H et al. deeply analyzed the impact of structural parameters on the performance of 3D printed super-capacitor electrodes based on graphene and carbon nanotubes through machine learning. The results showed that based on the constructed model, selecting structural parameters directly to print performance electrodes achieved precise customization of performance, with a performance adjustment range from 0.032 to 1.6 F cm$^{-2}$. The electrochemical active area and finite element analysis confirmed the dependence of surface performance on structural parameters, which was highly consistent with the model information [8]. Gruber D M et al. implemented a simple Atomic Force Microscope (AFM) model using 3D printing of commercial thermoplastic materials. The results indicated that the model included key components of AFM, such as z-axis platform, AFM head and cantilever beam assembly, laser, etc. The torque curves with protrusion, pull-out, and contact zone characteristics were collected through magnets and metal samples at the top of the cantilever beam [9]. Singhal et al. proposed an open-source basic chemical modeling module file for representing chemical formulas, chemical equations, and ionic states. The results indicated that these 3D printing modules could help visually impaired students understand and represent element and molecular reactions. Student feedback indicated that these scalable and Braille labeled models were very helpful for them to learn chemical formulas and ion valence states [10]. Shen H et al. proposed a flexible 3D printing support platform that could form an external support structure to replace the original support. The results indicated that for the cantilever structure model, platform optimization could minimize support while ensuring surface quality requirements. When there was no surface quality requirement, the improved algorithm could minimize the support of each model. Considering support and printing time, this algorithm was easy to obtain optimization results to guide practice. This method optimized the printing direction of the Particle swarm optimization (PSO) model on the platform, reduced material costs, and improved efficiency [11].

This evaluation model aims to improve the quality of relief printing while ensuring the accuracy and efficiency of the process through intelligent learning and data analysis. Gallegos M A et al. proposed an engineering platform for printing electronics. This platform utilized microscale additive manufacturing technology to design and produce microstructure materials with controllable pores and deformation as printing plates, achieving controlled ink transfer of submicron functional films. The results indicated that the degrees of freedom and high-resolution 3D control provided by this method supported the research of fluid transfer mechanisms, which is beneficial for the development of advanced manufacturing processes [12]. Vorzobova N D et al. proposed the formation conditions, diffraction, and selective properties of holographic gratings in various types of photopolymer materials. From the results, by changing the characteristics and orientation of the grating, high diffraction performance could be achieved over a wide range of incident angles in 3D space, expanding its angle range without the need for stacking optical components. The effective conditions for the formation of hybrid structures combining the performance of bulk gratings and relief gratings were determined, indicating that the relief part of the hybrid structure could expand the range of incident angles [13]. Maillard M et al. utilized slurry direct writing technology to prepare dense single/multi material ceramic components by controlling the needle to extrude ceramic slurry micro-filaments in 3D space. The results indicated that the rheological properties, printing parameters, and post-treatment heat treatment of ceramic slurry were the key to controlling the quality of printed components. The work characterized the rheological properties of alumina slurry and established their relationship with printing conditions, extrudability, shape fidelity, and final component mechanical properties [14]. Elizarova I S et al. proposed a new ceramic slurry that could still shape the green body after printing, solving the limited ability to print complex suspension structures using additive manufacturing ceramic continuous extrusion method. The results showed that after printing, the green body could be cut, bent, folded, formed around the object, and could be woven after drying. After slow pyrolysis and sintering, the mechanical properties were equivalent to those of traditional methods, and post press shaping did not affect the mechanical properties of the sintered body. This technology greatly improved the printability of complex structures [15]. On the basis of determining the optimal sintering temperature and time combination, \'{A} lvarez F et al. prepared different ink materials with a ceramic content of 75wt% (weight, ceramic content accounts for 75% of total body weight). The microstructure and mechanical properties of the sintered body were systematically studied, and the optimal ceramic content was determined. Finally, finite element simulation and catalytic experiments on the selected ceramic ink materials denoted that a 40% volume fraction linear filling mode 3D printing component was beneficial for improving catalytic performance. This process achieved controllable preparation of $\alpha$-Al2O3 complex structures [16].

In summary, the current 3D printing technology evaluation model for relief craft shows significant shortcomings in dealing with the variability of printing conditions and material characteristics and diversity. Conventional models are difficult to adapt to complex task requirements, resulting in printing quality not meeting the requirements. To address this issue, this research proposes an intelligent evaluation model that integrates Q-learning algorithm, utilizing its powerful learning and adaptability to achieve dynamic optimization and control of printing process parameters. This study aims to provide more efficient and accurate production evaluation tools for the 3D printing field of relief technology.

3. The Evaluation Model of 3D Printing Technology for Relief Craft Integrating Q-Learning Algorithm and AI

This research utilizes an evaluation index system enhanced by AI, specifically targeting the application of relief craft in 3D printing technology. The indicator system is combined with Q-learning algorithm for path planning optimization. Then an efficient 3D printing technology evaluation model is constructed. The model uses Q-learning algorithm to optimize the 3D printing path.

3.1 Design of AI Driven Evaluation Model For 3D Printing Technology in Relief Craft

This section first constructs an evaluation index system for 3D printing technology in relief technology, including accuracy, speed, product quality, and AI integration level. Then, the Q-learning algorithm and swarm intelligence are used to optimize the judgment matrix and dynamically adjust the weights. Finally, a comprehensive evaluation is conducted by combining Q-learning-based combination weight algorithm and ideal point method. The proposed model can quantitatively evaluate the accuracy, efficiency, and quality of 3D printing technology in relief technology.

Faced with the quality control challenges and efficiency issues in the development of 3D printing technology for relief craft, as well as the increasing demand for personalized products, it is particularly important to build a scientific and reasonable evaluation index system for 3D printing technology for relief craft [17,18]. Therefore, this study starts from the entire process of 3D printing technology and constructs a set of evaluation index system for 3D printing technology of relief craft. This index system has six characteristics: system integrity, layering, relative independence, guidance, practicality, and comparability. Considering the current situation of 3D printing technology in relief craft, 11 representative and covering evaluation indicators are selected from three dimensions: design accuracy, printing speed, and product quality. Through this indicator system, the application level of 3D printing technology in relief technology can be effectively evaluated, as indicated in Table 2.

Table 2. Evaluation index system for the application of 3D printing technology in relief craft.

Primary indicators

Secondary indicators

Development Level of Relief 3D Printing Tech

K

Proportion of Precision in Relief Printing

k1

3D Printing Speed

k2

Improvement Rate of 3D Printing Precision

k3

Optimization of Relief Printing Path

k4

Quality Level of Relief 3D Printed Products

S

Rate of Qualified Finished Products

s1

Surface Smoothness

s2

Material Utilization Rate

s3

Level of AI Integration in Relief 3D Printing

F

AI Optimization Ratio

f1

Q-learning Algorithm Optimization Effect

f2

Automated Printing Ratio

f3

Intelligent Path Planning Completion Degree

f4

In Table 2, the development level indicators of relief 3D printing technology include relief printing accuracy ratio (k1), 3D printing speed (k2), printing accuracy improvement rate (k3), and relief printing path optimization degree (k4). In terms of product quality, evaluation indicators include qualified yield (s1), surface smoothness (s2), and material utilization rate (s3). Regarding the AI integration level, indicators include AI optimization ratio (f1), Q-learning algorithm optimization effect (f2), automated printing ratio (f3), and intelligent path planning completion degree (f4). These indicators possess system integrity, layering, and directionality, which can effectively evaluate the practical application level of 3D printing technology in relief technology.

To efficiently estimate the application progress of 3D printing technology in relief craft, an AI evaluation model based on Q-learning algorithm optimization is designed. This model combines AI driven learning mechanisms and path planning algorithms, adjusts subjective and objective parameters, and combines traditional evaluation methods to assign weights and comprehensively analyze selected evaluation indicators, achieving quantitative and comprehensive evaluation of the application level of 3D printing technology in relief craft. The hierarchical analysis structure is shown in Fig. 1.

Fig. 1. Analytic hierarchy process.

../../Resources/ieie/IEIESPC.2025.14.2.242/image1.png

In Fig. 1, the AI analytic hierarchy process (AI-AHP) based on Q-learning algorithm is utilized to calculate the subjective weights of the evaluation indicators for 3D printing technology in relief craft. The specific steps include building a hierarchical structure model, and establishing a judgment matrix, using Q-learning algorithm to optimize weights and performing consistency testing to obtain the total weight. As a global optimization method, Q-learning algorithm can optimize the judgment matrix, avoid possible inconsistency issues, and improve the accuracy of evaluation index weight calculation [19,20]. The judgment matrix is shown in Eq. (1).

(1)
$ A=\left[\!\!\begin{array}{ccc} {a_{11} } & {\cdots } & {a_{1n} } \\ {\vdots } & {\vdots } & {\vdots } \\ {a_{n1} } & {\cdots } & {a_{nn} } \end{array}\!\!\right] . $

In Eq. (1), $n$ means the amount of evaluation indicators and $a_{nn} $ denotes the comparison of importance between indicators. The relative weight of elements under consistency testing is shown in Eq. (2).

(2)
$ w_{j} =\frac{\left(\prod _{j=1}^{n}a_{ij} \right)^{\frac{1}{n} } }{\sum _{i=1}^{n}\left(\prod _{j=1}^{n}a_{ij} \right)^{\frac{1}{n} } } . $

To optimize the consistency of the evaluation process, Q-learning is used to adjust the judgment matrix. The consistency check formula is shown in Eq. (3).

(3)
$ CR=\frac{\lambda _{\max } -n}{n-1 / RI }. $

In Eq. (3), $CR$ means the consistency ratio coefficient, $\lambda _{\max } $ refers to the maximum eigenvalue, and $RI$ expresses the random consistency index. The introduced Q-learning algorithm improves the consistency of judgment matrix through learning and iterative optimization, thereby enhancing the accuracy and efficiency of 3D printing technology evaluation in relief craft. The timing optimization and path planning functions of AI further enhance the dynamic adaptability of the evaluation model, making it more in line with the application requirements of 3D printing technology in relief craft. The consistency index values are indicated in Table 3.

Table 3. Consistency index value.

The order of judgment matrix

Random consistency index

Data 1

Data 2

1

0.00

0.10

0.20

2

0.00

0.20

0.30

3

0.52

0.30

0.40

4

0.89

0.40

0.50

5

1.11

0.50

0.60

6

1.25

0.60

0.70

7

1.35

0.70

0.80

8

1.4

0.80

0.90

9

1.45

0.90

1.00

10

1.49

1.00

1.10

In Table 3, Data 1 and Data 2 represent the consistency indices obtained by introducing Q-learning algorithm optimization under different judgment matrix orders. Among them, Data 1 is the consistency index of Q-learning initial parameter optimization, mainly referring to the consistency index value of the optimized judgment matrix under the initial conditions or specific parameter settings of the Q-learning algorithm. Data 2 shows the consistency index values of the optimized judgment matrix for different Q-learning parameters under another set of parameter settings in the Q-learning algorithm. On this basis, a nonlinear model can be obtained from the consistency judgment matrix and group information, as shown in Eq. (4).

(4)
$\begin{align} \left\{\begin{aligned} & \min H\left(w_{j} \right)=\frac{\sum _{i=1}^{n}\left|\sum _{j=1}^{n}\left(a_{ij} w_{j} \right)-nw_{i} \right| }{n} , \\ & \text{s.t.}~\sum _{j=1}^{n}w_{j} =1,~0\le w_{j} \le 1,~j=1,~2,~\ldots ,~n . \end{aligned}\right.\end{align} $

Q-learning algorithm and swarm intelligence are used to construct a nonlinear model, optimize the judgment matrix of 3D printing relief craft evaluation indicators, and dynamically adjust weights. The QLEM method utilizes information entropy to quantitatively evaluate indicator weights, and Q-learning further optimizes these weights in a temporal manner, improving the objectivity and adaptability of the evaluation. The key steps include establishing a standardized indicator matrix, calculating feature weights, obtaining entropy weights, and finally calculating the coefficient of difference. The indicator weight is shown in Eq. (5).

(5)
$ q_{j} =\frac{1-k_{j} }{\sum _{j=1}^{n}\left(1-k_{j} \right) }. $

In Eq. (5), $k_{j} $ represents the entropy weight. By combining the temporal optimization ability in AI technology, dynamic adjustment of indicator weights can be achieved.

After calculating the subjective and objective weights, a combination weight algorithm based on Q-learning is used to integrate the two weights and obtain the comprehensive weight of a single indicator. The goal of this combination weight algorithm is to find a weight combination that minimizes the information difference between subjective and objective weights while maintaining the dynamic adaptability of the evaluation model, to make the evaluation results more accurate and reasonable. The combined weight is shown in Eq. (6).

(6)
$ w_{j} =\frac{\sqrt{w_{1j} \times w_{2j} } }{\sum _{j=1}^{n}\sqrt{w_{1j} \times w_{2j} } } . $

In Eq. (6), $j$ refers to the number of indicator items, $w_{j} $ means the combination weight, $w_{1j} $ means the subjective weight, and $w_{2j} $ means the objective weight. Finally, the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) is utilized to conduct a comprehensive evaluation of 3D printing technology for relief craft. The overall model structure is shown in Fig. 2.

Fig. 2. Overall model structure.

../../Resources/ieie/IEIESPC.2025.14.2.242/image2.png

This method judges the quality of an object by evaluating its relative proximity to positive and negative ideal solutions. The ideal point method can fully utilize evaluation indicator data, reduce information loss, and improve the reliability of evaluation results. The operation steps are as follows: establishing a decision matrix, performing dimensionless data processing, calculating the weighted normalized decision matrix, obtaining positive and negative ideal solutions, and calculating the distance between the evaluation scheme and the positive and negative ideal solutions. The positive ideal solution is shown in Eq. (7).

(7)
$ z_{i}^{+} =\max \left(z_{i1}^{+} ,~z_{i2}^{+} ,~\ldots ,~z_{im}^{+} \right) . $

In Eq. (7), $z_{ij} $ represents the weighted data. The negative ideal solution is denoted in Eq. (8).

(8)
$ z_{i}^{-} =\min \left(z_{i1}^{-} ,~z_{i2}^{-} ,~\ldots ,~z_{im}^{-} \right) . $

By combining the indicator weights obtained from the Q-learning algorithm-based AI evaluation method with the evaluation results calculated by the TOPSIS method, an analysis is conducted in a non-integrated evaluation model. This model can quantitatively evaluate the accuracy, efficiency, and product quality of 3D printing technology in relief technology, thereby promoting the application of 3D printing technology in relief craft. The distance from the object to the positive ideal solution is shown in Eq. (9).

(9)
$ D_{i}^{+} =\sqrt{\sum _{j=1}^{n}(z_{ij} -z_{i}^{+} )^{2} } ,~i=1,~2,~\ldots ,~m. $

The distance from the object to the negative ideal solution is shown in Eq. (10).

(10)
$ D_{i}^{-} =\sqrt{\sum _{j=1}^{n}(z_{ij} -z_{i}^{-} )^{2} } ,~i=1,~2,~\ldots ,~m. $

The comprehensive evaluation value is indicated in Eq. (11).

(11)
$ A_{i} =\frac{D_{i}^{-} }{D_{i}^{+} +D_{i}^{-} } ,~i=1,~2,~\ldots ,~m . $

This model has broad application prospects, which can be utilized to assess the performance of different 3D printers in the application of relief craft, providing support for the selection and optimization of 3D printing technology in relief craft.

3.2 Design of 3D Printing Path Planning Optimization Model Based on Q-Learning Algorithm

After completing the AI-driven evaluation model for 3D printing technology in relief technology, a 3D printing path planning optimization model based on Q-learning algorithm is constructed. Firstly, based on the principles of path planning timing optimization, the model hypothesis is established and an objective function is established using the principles of accuracy, efficiency, cost, and stability. At the same time, the balance constraint between printing accuracy and speed are set. Subsequently, under model constraints, the Q-learning algorithm is optimized and combined with multi-objective decision-making techniques to address the challenges of 3D printing path planning. Finally, the evaluation model integrates AI experience replay, selects the global optimal path through standardized objective function, local optimal path search and comparison, and formulates the optimal path plan. Under the comprehensive influence of numerous influencing factors, a 3D printing path planning optimization model based on Q-learning algorithm is constructed. Firstly, based on the principle of path planning timing optimization, the following model assumptions are established as shown in Fig. 3.

Fig. 3. Model assumptions.

../../Resources/ieie/IEIESPC.2025.14.2.242/image3.png

According to the optimization principles, the study adopts the principles of accuracy, efficiency, cost, and stability to establish the objective function. The accuracy principle is to maximize printing accuracy and ensure that the details of the relief craft can be accurately presented. The efficiency principle is to minimize the total printing time and improve the efficiency of 3D printing operations. The cost principle is manifested in reducing material consumption and maintenance cost during the 3D printing process. The stability principle is manifested in ensuring the stable operation of the 3D printing process, in order to reduce the risk of printing failure. In terms of model constraints, the balance between printing accuracy and speed is taken as the priority constraint.

The printing accuracy constraint is shown in Eq. (12).

(12)
$ \sum _{j=1}^{\sigma }\sum _{k=1}^{K}\gamma _{j,k,t} \times E_{j,k,t} \times \left(1-\varepsilon _{j,k,t} \right)\ge D_{t} . $

In Eq. (12), $D_{t} $ represents the 3D printing project of Class $t$ relief craft, and $\varepsilon _{j,k,t} $ represents the accuracy standard required for the printing task of Class $j$ and $k$ projects in the $t$th printing task. The printing efficiency constraint needs to satisfy Eq. (13).

(13)
$ \sum _{j=1}^{\sigma }\sum _{k=1}^{K}\frac{\gamma _{j,k,t} \times E_{j,k,t} \times \left(1-\varepsilon _{j,k,t} \right)}{H_{j,k,t} } \ge \left(1+r\right)L_{\max ,t}^{D} . $

In Eq. (13), $L_{\max ,t}^{D} $ represents the maximum printing task quantity, $H_{j,k,t} $ represents the printing time, and $r$ represents the printing efficiency. The material consumption constraint needs to satisfy Eq. (14).

(14)
$ j=\sum _{j=1}^{\sigma }P_{j,t} \le P_{j,t,\max } . $

In Eq. (14), $P_{j,t} $ means the total material consumption, and $P_{j,t,\max } $ means the maximum allowable material consumption. The stability constraint for printing needs to meet Eq. (15).

(15)
$ \lambda _{j} -p_{j} \times t\le \frac{E_{j,t} }{\sum _{j=1}^{\sigma }E_{j,t} } \le \lambda _{j} +z_{j} \times t . $

In Eq. (15), $\lambda _{j} $ means the stability index of the task, $p_{j} $ means the upper limit of stability decline rate, $E_{j,t} $ means the amount of printing tasks, and $z_{j} $ means the upper limit of stability increase rate. The research aims to address the challenges of 3D printing path planning by optimizing the Q-learning algorithm and combining multi-objective decision-making techniques. In the optimization process of 3D printing technology in relief technology, a quantitative evaluation objective function is established to accurately measure the material saving effect. According to the material consumption constraint, the objective function can be defined, as shown in Eq. (16).

(16)
$ Minimize~ F_{m} =\sum _{i=1}^{N}\frac{M_{i} }{M_{\max } }. $

In Eq. (16), $F_{m} $ represents the objective function of material utilization savings. $N$ represents the total number of printing tasks. $M_{i} $ represents the actual material consumption of the $i$-th printing task. $M_{\max } $ represents the maximum allowable material consumption. This objective function accumulates the ratio of the actual material consumption $M_{i} $ to the maximum allowable consumption $M_{\max } $ for all printing tasks to form a comprehensive indicator, thereby quantifying the material saving effect. The optimization goal is to minimize the total material consumption ratio while meeting the process requirements, in order to achieve the optimization principle of cost reduction. Abandoning traditional path strategies and incorporating AI experience replay to enable learning to occur on specific trajectories, ensuring the integrity, robustness, and non-redundancy of action reward encoding. The encoding reflects the path selection and its impact on printing accuracy and efficiency, and adjusts algorithm parameters such as learning rate, discount factor, and exploration rate. The algorithm process is shown in Fig. 4.

Fig. 4. Improved Q-learning algorithm process.

../../Resources/ieie/IEIESPC.2025.14.2.242/image4.png

In Fig. 4, the evaluation model incorporates AI experience replay to ensure that learning takes place on specific trajectories, making the action reward encoding complete and robust. By standardizing the objective function, the dimensions of printing accuracy, speed, and material consumption are eliminated to unify the evaluation system. After the initial strategy is generated, the strategy with the highest reward value is directly calculated and retained. The learning update rules of the Q-learning and $\varepsilon$-greedy algorithm are applied to local optimal path search. Before reaching the predetermined number of learning times, continuous exploration and optimization are carried out. Finally, all local optimal solutions are compared to select the global optimal path and formulate the best path plan. This model significantly improves the accuracy and efficiency of path planning by improving various parameters such as learning rate, discount factor, and exploration rate.

4. Evaluation Model for 3D Printing Technology in Relief Craft

By combining Q-learning algorithm and AI to optimize the evaluation model of 3D printing technology in relief craft, key parameters of the printing process are analyzed through AI. Then dynamic weights are assigned to these parameters using Q-learning algorithm to achieve iterative optimization. By comparing algorithmic differences, optimization solutions for path planning are selected to improve accuracy and efficiency in practical 3D printing applications, expecting to provide an effective evaluation tool for the fine printing control of relief technology.

4.1 Analysis of Q-Learning Algorithm Optimization Effect in Evaluation Model

To ensure the accuracy of the experimental outcomes, the research is conducted on the testing and optimization analysis of the Q-learning algorithm using a cloud server platform. In the 3D printing technology of relief craft, Q-learning algorithm is used for path planning. Through a cloud server platform, an AI optimized path planning environment is simulated during the 3D printing process. The Q-learning algorithm is carefully adjusted and tested for the learning and decision-making processes of different relief patterns. Key algorithm parameters include reward function, learning rate, and discount factor. The hardware configuration includes optimizing the processor, memory, and storage space of the cloud server to ensure efficient data processing. Machine learning frameworks and programming environments are selected for software configuration. The details of software and hardware configuration and model parameter settings are shown in Table 4. Table 4 contains the software and hardware configurations and model parameter settings for the cloud server platform used for Q-learning algorithm testing and optimization analysis. The hardware configuration includes key components such as high-performance processors and high-capacity memory to ensure efficient data processing of 3D printing embossing technology in the Q-learning algorithm path planning process. The software configuration selects an adapted programming environment and machine learning framework. In addition, the parameter adjustment of the Q-learning algorithm is also set to meet the complex process requirements of 3D printing.

Table 4. Details of software and hardware configuration and model parameter settings.

Set up

Details

Hardware configuration

Cloud instance

Platforms provided by cloud service providers

Instance type

Optimization for AI computing

CPU

High performance multi-core processor (16 cores, 3.0GHz)

RAM

64GB high-capacity memory

operating system

Linux distribution (Ubuntu 20.04)

Software configuration

programming language

Python 3.8

Mathematical Library

NumPy 1.21, SciPy 1.7

AI framework

TensorFlow 2.6

Path planning module

Customized Q-learning module

data management

TensorFlow 2.6

version control

Git 2.30

Parameter settings

Learning rate

zero point zero one

Reward mechanism

Customizing the accuracy of relief artworks

Training rounds

one thousand

State representation

Encoding to capture the features of relief art

Discount factor

zero point nine five

Exploring strategies

Epsilon Green algorithm (ε=0.1)

Termination conditions

Converge or reach the maximum number of training rounds, such as 1000 rounds)

Action set

Define for 3D printing path planning

To conduct performance testing on the evaluation model, the average convergence of Q-learning algorithm in the application of relief craft path planning in 3D printing technology was tested, which was also compared with Improved Heuristic-based Search Algorithm (IHSA) and Ant Colony Optimization for Path Planning (ACOPP) for comparative testing. The average fitness value of the model was used as the indicator for testing. The test results are expressed in Fig. 5.

Fig. 5. Average convergence performance test results of three models.

../../Resources/ieie/IEIESPC.2025.14.2.242/image5.png

In Fig. 5, the Q-learning algorithm optimization model used in the study exhibited fast convergence speed and excellent average fitness. The optimal average fitness value of the model reached $1.519\times10^{-3}$. The results indicates that compared with other comparative models, the Q-learning algorithm has significantly improved optimization speed and efficiency.

According to the research on 3D printing technology path planning in the predicted relief craft by 2030, if no optimization measures were taken, the efficiency and accuracy of path planning would show a downward trend, as shown in Fig. 6.

Fig. 6. Overall emission reduction effect.

../../Resources/ieie/IEIESPC.2025.14.2.242/image6.png

In Fig. 6, the parameter adjustments for the application mainly include expanding the state space, optimizing the reward mechanism, enhancing learning rate regulation, and balancing exploration and utilization. The technical roadmap includes basic algorithm analysis, integration of emerging Q-learning technologies, combination of AI assistance methods, and implementation of modular components. From the data in Fig. 6, the path planning of the original algorithm took 8 hours and consumed 250 kilowatt hours. All four Q-learning optimizations improved planning efficiency and accuracy, but "simple parameter adjustment" did not significantly improve. The time was only reduced to 7.5 hours, and the energy consumption was decreased to 245 kilowatt hours. This reflects that algorithm adjustments have not fully adapted to complex processes, and there is a lack of in-depth research on the synergistic benefits of Q-learning and AI. The process optimization effect is shown in Fig. 7.

Fig. 7. Overall emission reduction effect.

../../Resources/ieie/IEIESPC.2025.14.2.242/image7.png

As shown in Fig. 7, for path planning time, path length, and energy consumption, the model optimized by AI algorithm had the most significant emission reduction effect in scenarios with high relief complexity, followed by scenarios with model structure adjustment. The priority order for optimizing the potential of material waste and unnecessary energy consumption in 3D printing technology was: algorithm structure optimization scenario$\mathrm{>}$AI model training and improvement scenario$\mathrm{>}$parameter fine-tuning scenario. For printing speed, only AI model training could improve efficiency in scenarios, while fine-tuning parameters in scenarios might lead to performance degradation. In order to further verify the application effect of Q-learning algorithm in 3D printing technology of relief technology, a comparative analysis is conducted by introducing traditional genetic algorithm (GA) [21] and PSO [22]. The results are shown in Table 5.

Table 5. Comparison of Q-learning algorithm and other methods for path optimization.

Relief craftsmanship

Q-learning duration (Hours)

Q-learning accuracy (%)

GA duration (Hours)

GA accuracy (%)

PSO duration (Hours)

PSO accuracy (%)

1

3.2

96.13

5.4

90.75

6.2

91.44

2

4.5

86.11

7.8

82.20

8.4

83.49

3

0.4

87.24

0.6

85.30

0.8

86.23

4

6.9

90.23

11.3

88.50

12.1

87.02

5

7.2

94.23

11.6

91.80

13.4

91.18

6

5.2

86.12

7.0

83.70

7.5

84.12

7

3.7

87.12

5.1

85.20

5.7

85.43

8

3.0

88.23

4.1

86.40

4.6

87.11

9

5.4

95.12

8.6

91.00

9.3

91.13

10

1.1

95.11

1.9

89.80

2.2

89.56

11

0.6

94.26

0.9

91.50

1.1

92.11

12

3.8

96.25

5.7

92.10

6.5

91.35

In Table 5, the performance of Q-learning was significantly better than GA and PSO. In relief process 1, the optimization time of Q-learning was 3.2 hours, with an accuracy of 96.13%, showing significant advantages compared with GA's 5.4 hours and 90.75%, as well as PSO's 6.2 hours and 91.44%. By adaptively learning path planning strategies, Q-learning can converge to the optimal solution faster, reduce the time required for path planning, and improve accuracy. The results of relief process 5 further validated the superiority of Q-learning, with an optimization time of 7.2 hours and an accuracy of 94.23%, which was much better than GA's 11.6 hours and 91.80%, as well as PSO's 13.4 hours and 91.18%. Q-learning enables path planning to achieve high accuracy in a short period of time by continuously iterating and updating strategies. In addition, the optimization results of relief process 9 also showed that Q-learning had a significant improvement in path planning accuracy, with an optimization time of 5.4 hours and an accuracy of 95.12%. It effectively improves the accuracy and efficiency of path planning by continuously adjusting strategies during the learning process. Although the relief process 6 performed average in terms of accuracy improvement, only improving by 0.4%, its time efficiency increased by 14.2 hours, still demonstrating a significant advantage of Q-learning in time optimization. Similarly, the time efficiency improvement of relief process 3 was relatively low, only 1.2 hours, but its accuracy was increased by 1.3%, indicating that Q-learning can also play an important role in small-scale optimization. Therefore, Q-learning performs better than GA and PSO in optimizing the 3D printing path of relief technology, demonstrating its effectiveness in relief technology 3D printing.

4.2 Analysis of the 3D Printing Model Effect of AI Assisted Relief Craft

After analyzing the 3D printing model effect of the relief craft assisted by Q-learning algorithm, the iterative effect of the model is first analyzed. Then the performance of the technology in practical applications is analyzed. The iterative process is shown in Fig. 8.

Fig. 8. Iteration status.

../../Resources/ieie/IEIESPC.2025.14.2.242/image8.png

In Fig. 8, after 40 iterations, the three main evaluation criteria for path planning no longer showed significant changes, indicating that the AI assisted 3D printing path planning for relief craft reached a stable state of optimal solution. From the perspective of path planning performance analysis, the number of iterations significantly affected the speed of reaching a stable optimal solution. Therefore, adjusting the number of iterations appropriately in the Q-learning algorithm iteration craft is beneficial for achieving a stable optimal solution faster. The comparison of path optimization effects in 3D printing is shown in Table 6.

In Table 6, in the 3D printing relief craft, AI assisted path planning utilized Q-learning algorithm to improve the efficiency of printing path and material usage. Although Q-learning and improved GA had similar results in material consumption and printing time, demonstrating the ability to save materials and improve efficiency, Q-learning may be higher in total cost than GA, indicating that it may not necessarily be the best option in cost savings. The limitations of Q-learning in timing optimization may lead to non global optimal solutions. In addition, it shows limitations in considering all variables in the printing process and dealing with multi-objective optimization problems, while the improved GA can more comprehensively balance path, material, and time, providing a more effective and economical relief 3D printing solution. The efficiency of the Q-learning algorithm in 3D printing path planning is shown in Fig. 9.

Table 6. Comparison of path optimization effects in 3D printing.

Indicator type

Index

Q-learning optimization

GA optimization [21]

Difference

Quantitative indicators

Material usage (kg)

110.00

110.00

0.00

Printing time (hours)

72.00

70.24

2.00

System cost (1000 yuan)

95.36

90.25

5.01

Plan timing distribution

2021

-

-

-

2022

-

-

-

2023

Path A, Path C

Path A, Path B

-

2024

Path D, Path F

Path D, Path E

-

2025

Path G

Path H

-

Fig. 9. Q-learning algorithm efficiency in 3D printing path planning.

../../Resources/ieie/IEIESPC.2025.14.2.242/image9.png

In Fig. 9, the Q-learning algorithm demonstrated its potential in terms of 3D printing path planning efficiency and material usage. This algorithm performed well in finding the optimal printing path and minimizing material usage. In addition, the Q-learning algorithm exhibited strong convergence ability and high solving efficiency during the iteration process, which is crucial for dealing with complex multi-objective optimization problems. Although there may be room for improvement, the current results have demonstrated the effectiveness of the Q-learning algorithm in path planning for 3D printing models in relief technology.

5. Conclusion

In 3D printing and relief craft, the efficiency and accuracy of path planning are crucial for productivity and product quality. In response to this challenge, an evaluation model combining Q-learning algorithm and AI is adopted to optimize path planning. Through multi-objective optimization, this algorithm could automatically adjust the printing path, reduce time and material consumption, while ensuring high-quality finished products. The experimental results reflected the effectiveness of AI technology in improving path planning performance. The data showed that the printing time of relief craft 11 was shortened by 1.9 hours, and the accuracy was improved by 1.3%. The time efficiency of the relief craft 10 was improved by 3.8 hours, and the accuracy was increased by 3.6%. This progress demonstrated the potential of using AI for 3D printing path optimization. The main contribution of the evaluation model is to improve printing efficiency and reduce material waste and post-processing by improving accuracy. It provides an effective path optimization reference for solving complex 3D printing tasks. However, the computational cost of the model in dealing with high complexity structures is still high. Its applicability needs to be further validated for different printing materials and technologies. Future research will focus on improving the computational efficiency of the algorithm, adapting to more complex printing tasks, and exploring the applicability in various 3D printing technologies. It should strive to improve the universality of the model, adapt to a wider range of materials and devices, and promote the development of 3D printing towards automation and intelligence.

Fundings

The research is supported by: the Teaching Research and Reform Project of Changsha University of Science & Technology (Grant number XJG24-096).

REFERENCES

1 
M. Mohammadi, G. Becker, S. Diener, J. M. Tulliani, N. Katsikis, and P. Palmero, ``Robocasting of dense zirconia parts using commercial yttria-stabilized zirconia granules and ultrafine particles. Paste preparation, printing, mechanical properties,'' Ceramics International, vol. 48, no. 2, pp. 1936-1946, 2022.DOI
2 
B. Dietemann, L. Wahl, N. Travitzky, H. Kruggel-Emden, T. Kraft, and C. Bierwisch, ``Reorientation of suspended ceramic particles in robocasted green filaments during drying,'' Materials, vol. 15, no. 6, pp. 2100-2111, 2022.DOI
3 
J. Bonilla-Cruz, M. A. Ávila-López, F. E. L. Rodríguez, A. Aguilar-Elguezabal, and T. E. Lara-Ceniceros, ``3D printable ceramic pastes design: Correlating rheology & printability,'' Journal of the European Ceramic Society, vol. 42, no. 13, pp. 6033-6039, 2022.DOI
4 
E. G. de Moraes, I. M. Ferreira, L. B. Teixeira, L. H. Cartapati, M. T. Souza, and A. O. N. de Oliveira, ``Additive manufacturing of cellular structures from recycled soda-lime glass printing inks by robocasting,'' Ceramics International, vol. 49, no. 4, pp. 6554-6562, 2023.DOI
5 
O. Fagbohungbe, S. R. Reza, X. Dong, and L. Qian, ``Efficient privacy preserving edge intelligent computing framework for image classification in IoT,'' IEEE Transactions on Emerging Topics in Computational Intelligence, vol. 6, no. 4, pp. 941-956, 2021.DOI
6 
S. Srivastava, A. V. Divekar, C. Anilkumar, I. Naik, V. Kulkarni, and V. Pattabiraman, ``Comparative analysis of deep learning image detection algorithms,'' Journal of Big Data, vol. 8, no. 1, pp. 1-27, 2021.DOI
7 
J. Li, R. Yan, Y. Yang, and F. Xie, ``Water-based binder preparation and full-color printing implementation of a self-developed 3D printer,'' Rapid Prototyping Journal, vol. 27, no. 3, pp. 530-536, 2021.DOI
8 
H. Yang, L. Fang, Z. Yuan, X. Teng, H. Qin, Z. He, and M. Wu, ``Machine learning guided 3D printing of carbon microlattices with customized performance for supercapacitive energy storage,'' Carbon, vol. 201, no. 1, pp. 408-414, 2023.DOI
9 
D. M. Gruber, T. Perez, B. Q. Layug, M. Ohama, L. Tran, L. A. F. Rojas, and W. J. Miller, ``Three-dimensional printing of a model atomic force microscope to measure force-distance profiles,'' Journal of Chemical Education, vol. 97, no. 3, pp. 845-849, 2020.DOI
10 
I. Singhal and B. S. Balaji, ``Creating atom representations using open-source, stackable 3D printed interlocking pieces with tactile features to support chemical equation writing for sighted and visually impaired students,'' Journal of Chemical Education, vol. 97, no. 1, pp. 118-124, 2019.DOI
11 
H. Shen, X. Ye, G. Xu, L. Zhang, J. Qian, and J. Fu, ``3D printing build orientation optimization for flexible support platform,'' Rapid Prototyping Journal, vol. 26, no. 1, pp. 59-72, 2020.DOI
12 
M. A. Gallegos, C. M. Garcia, P. R. Schunk, B. C. White, B. L. Boyce, and E. B. Secor, ``Investigating porous media for relief printing using micro-architected materials,'' Advanced Engineering Materials, vol. 22, no. 11, 2000548, 2020.DOI
13 
N. D. Vorzobova and P. P. Sokolov, ``Formation and properties of volume and relief holographic gratings in photopolymer materials,'' Journal of Optical Technology, vol. 90, no. 3, pp. 114-118, 2023.DOI
14 
M. Maillard, J. Chevalier, L. Gremillard, G. P. Baeza, E. J. Courtial, S. Marion, and V. Garnier, ``Optimization of mechanical properties of robocast alumina parts through control of the paste rheology,'' Journal of the European Ceramic Society, vol. 43, no . 7, pp. 2805-2817, 2023.DOI
15 
I. S. Elizarova, L. Vandeperre, and E. Saiz, ``Conformable green bodies: Plastic forming of robocasted advanced ceramics, Journal of the European Ceramic Society, vol. 40, no. 2, pp. 552-557, 2020.DOI
16 
F. Álvarez, A. Cifuentes, I. Serrano, L. Franco, G. Fargas, F. Fenollosa, and J. J. Roa, ``Optimization of the sintering thermal treatment and the ceramic ink used in direct ink writing of $\alpha$-Al2O3: Characterization and catalytic application,'' Journal of the European Ceramic Society, vol. 42, no. 6, pp. 2921-2930, 2022.DOI
17 
E. Romanczuk-Ruszuk, B. Sztorch, D. Pakuła, E. Gabriel, K. Nowak, and R. E. Przekop, ``3D printing ceramics-materials for direct extrusion process,'' Ceramics, vol. 6, no. 1, pp. 364-385, 2023.DOI
18 
A. J. Martin, W. Li, J. Watts, G. E. Hilmas, M. C. Leu, and T. Huang, ``Particle migration in large cross-section ceramic on-demand extrusion components,'' Journal of the European Ceramic Society, vol. 43, no. 3, pp. 1087-1097, 2023.DOI
19 
A. Rosenberger, N. Ku, L. Vargas-Gonzalez, M. Alazzawi, and R. Haber, ``Rheology and processing of UV-curable textured alumina inks for additive manufacturing,'' International Journal of Applied Ceramic Technology, vol. 18, no. 5, pp. 1457-1465, 2021.DOI
20 
X. M. Long, Y. J. Chen, and J. Zhou, ``Development of AR experiment on electric-thermal effect by open framework with simulation-based asset and user-defined input,'' Artificial Intelligence and Applications, vol. 1, no. 1, pp. 52-57, 2023.DOI
21 
M. Wang, Y. Yang, G. Chi, K. Yuan, F. Zhou, L. Dong, H. Liu, Q. Zhou, W. Gong, S. Yang, and T. Tang, ``A 3D printed Ga containing scaffold with both anti-infection and bone homeostasis-regulating properties for the treatment of infected bone defects,'' Journal of Materials Chemistry B, vol. 23, no. 9, pp. 4735-4745, 2021.DOI
22 
S. C. Kiong and S. E. Teu, ``Optimization of process parameters for polylactic acid (PLA) of FDM using particle swarm optimization (PSO),'' Research Progress in Mechanical and Manufacturing Engineering, vol. 4, no. 1, pp. 205-217, 2023.DOI

Author

Ying Jiang
../../Resources/ieie/IEIESPC.2025.14.2.242/author1.png

Ying Jiang received her M.S. degree from Jingdezhen Ceramic University, Jingdezhen, China, in 2008. She is currently serving as a Lecturer in the School of Design and Art at Changsha University of Science and Technology. Her research centers on innovative design solutions and the application of advanced technologies within the field. Her work aims to optimize the design process and production of sculpture crafts, merging modern technological advancements with traditional craftsmanship. She has published articles in more than 10international reputed journals. Her research helps bridge the gap between innovative technology and practical design, driving progress in both aesthetic and functional aspects of product design.