<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JIS</journal-id><journal-title-group><journal-title>Journal of Information Security</journal-title></journal-title-group><issn pub-type="epub">2153-1234</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jis.2023.144020</article-id><article-id pub-id-type="publisher-id">JIS-128204</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  An Extensive Study and Review of Privacy Preservation Models for the Multi-Institutional Data
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sagarkumar</surname><given-names>Patel</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Rachna</surname><given-names>Patel</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ashok</surname><given-names>Akbari</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Srinivasa</surname><given-names>Reddy Mukkala</given-names></name><xref ref-type="aff" rid="aff4"><sup>4</sup></xref></contrib></contrib-group><aff id="aff3"><addr-line>Department of Pharmacy, Shree Naranjibhai Lalbhai Patel College of Pharmacy, Umrakh, Surat, India</addr-line></aff><aff id="aff1"><addr-line>Department of Biometrics, LabCorp Drug Development Inc., Somerset, USA</addr-line></aff><aff id="aff2"><addr-line>Department of Biometrics, Catalyst Clinical Research LLC, Wilmington, USA</addr-line></aff><aff id="aff4"><addr-line>Department of Biostatistics, EpisData, Sterling Heights, USA</addr-line></aff><pub-date pub-type="epub"><day>08</day><month>08</month><year>2023</year></pub-date><volume>14</volume><issue>04</issue><fpage>343</fpage><lpage>365</lpage><history><date date-type="received"><day>15,</day>	<month>September</month>	<year>2023</year></date><date date-type="rev-recd"><day>7,</day>	<month>October</month>	<year>2023</year>	</date><date date-type="accepted"><day>10,</day>	<month>October</month>	<year>2023</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  The deep learning models hold considerable potential for clinical applicatio
  ns, 
  but 
  there are many challenges to successfully training deep learning models. Large-scale data collection is required, which is frequently only possib
  le through multi-institutional cooperation. Building large central repositories is one strategy for multi-institution studies
  .
   However
  ,
   this is hampered by issues regarding data sharing, including patient privacy, data de-identification, regulation, intellectual property, and data storage. These difficulties have lessened the impracticality of central data storage. In this survey, we will look at 24 research publications that concentrate on machine learning approaches linked to privacy preservation techniques for multi-institutional data, highlighting the multiple shortcomings of the existing methodologies. Researching different approaches will be made simpler in this case based on a number of factors, such as performance measures, year of publication and journals, achievements of the strategies in numerical assessments, and other factors. A technique analysis that considers the benefits and drawbacks of the strategies is additionally provided. The article also looks at some potential areas for future research as well as the challenges associated with increasing the accuracy of privacy protection techniques. The comparative evaluation of the approaches offers a thorough justification for the research’s purpose.
 
</p></abstract><kwd-group><kwd>Privacy Preservation Models</kwd><kwd> Multi Institutional Data</kwd><kwd> Bio Technologies</kwd><kwd> Clinical Trial and Pharmaceutical Industry</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Currently, the world is increasingly witnessing technological innovations such as big data, nanotechnology, cloud computing, biotechnologies, artificial intelligence, and the Internet of Things (IoT), which collectively form part of the fourth industrial revolution [<xref ref-type="bibr" rid="scirp.128204-ref1">1</xref>] . Furthermore, the fourth industrial revolution has brought in fast-paced development in commercial and governmental organizations alike, affecting everyday pursuits worldwide [<xref ref-type="bibr" rid="scirp.128204-ref2">2</xref>] . Chapter two discusses the works carried out by various researchers to assure secure data preservation in health care applications. Existing literature suggests that Anonymization-based models, Blockchain-based models, and optimization approaches can be used synergistically to address the issues concerning the secure provision of e-health data.</p><p>IoT is one such present-day innovation that has percolated routine life through various applications, ensuring a safe, innovative, and more accessible environment [<xref ref-type="bibr" rid="scirp.128204-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref4">4</xref>] . IoT enables the interaction between humans as it integrates millions of people, intelligent nodes, substantial objects, services, and digital sensors; it also consists of millions of digital sensors [<xref ref-type="bibr" rid="scirp.128204-ref5">5</xref>] . The significant feature of the IoT is that all the objects in the environments are interlinked to each other as it transfers the data in any part of the world at any time [<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>] . Modern innovations such as big data, cloud computing, fog computing, allocated computing, and wireless communication aid the IoT to attain its intention of facilitating the interaction between intelligent [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] . IoTs are used in various applications and domains, such as coordination, transportation, medical care, well-being, insight, and many more applications.</p><p>The IoT devices are able to produce enormous amounts of data known as big data due to a wide range of applications including transportation, smart homes, the health care industry, and electricity conservation [<xref ref-type="bibr" rid="scirp.128204-ref13">13</xref>] . The following section briefly elucidates IoT big data. IoT gadgets are known to involve copious amounts of private and sensitive information. For example, Cisco expects that 500 billion devices will be connected to the internet by 2025 [<xref ref-type="bibr" rid="scirp.128204-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] . As a result, the amount of structured and unstructured data has increased by 2.5 Exa bytes daily [<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>] . On the other hand, the worldwide server farm’s IP traffic would only arrive at 10.4 zetta bytes [<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>] . This rapid expansion in information volume is attributed to web administrations, versatile information, and medical care information [<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>] .</p><p>With the rapid adoption of global IoT-connected devices, enormous amounts of data are transferred between cloud-based and physical network environments. It has also brought in many technologies that create a vision of interconnecting the world through devices. A plethora of privacy and security challenges are witnessed in IoT architectures [<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref23">23</xref>] . Generally, IoT-based healthcare applications hold sensitive medical details of the patients, for which confidentiality is necessary to ensure the privacy of the patients. Due to challenges associated with digital data, conventional encryption strategies over structural and textual one-dimensional data are not used for e-health data directly. In addition, when sensitive information is forwarded through open channels, patients may suffer from the loss of information contents. Hence, a secure key frame extraction strategy is needed to ensure appropriate privacy-preserving e-health services. The multi-institutional clinical data yield enough information to identify the small differences and improve general ability.</p><p>Between 2017 and 2019, data breach incidents rose from 15% to 26%, as reported by professionals involved in risk oversight activities [<xref ref-type="bibr" rid="scirp.128204-ref24">24</xref>] . The healthcare industry is turning to big data technology to improve and manage medial systems. For this purpose, healthcare companies and organizations are leveraging big data in health informatics [<xref ref-type="bibr" rid="scirp.128204-ref25">25</xref>] . The security gap where the question arises is whether the third-party policies and safeguards regarding IoT security are sufficient for preventing data breaches which is one of the main reasons for the rising IoT threats. Recently, sophisticated hacks on IoT devices have aggravated the problem, and therefore, the privacy and security problems in the IoT paradigm are discussed in detail. In this chapter, the existing pool of literature that is relevant to the research topic will be reviewed by surveying academic journals, technical reports and data, books, scholarly articles, and other relevant publications. The secondary literary sources will be reviewed and analyzed thoroughly that are relevant to the privacy and data security management in healthcare and clinical research, the role of IoT and big data analytics (BDA), as well as the risks and challenges concerning data privacy and security management in IoT and big data analytics.</p>Deep Learning Models in Clinical Applications<p>In recent years, the integration of deep learning models into clinical applications has ushered in a new era of healthcare innovation. These advanced computational tools have demonstrated immense potential in revolutionizing disease diagnosis, treatment planning, and patient care [<xref ref-type="bibr" rid="scirp.128204-ref26">26</xref>] . The ability of deep learning algorithms to analyze complex medical data, such as medical images, genomic sequences, and electronic health records, has opened up a plethora of opportunities for improving clinical outcomes and healthcare delivery [<xref ref-type="bibr" rid="scirp.128204-ref27">27</xref>] . Deep learning models, particularly convolutional neural networks (CNNs) and recurrent neural networks (RNNs), have shown remarkable capabilities in tasks like medical image classification, natural language processing, and predictive modeling. From the early detection of tumors in radiological scans to the personalized treatment recommendations based on genetic profiles, the impact of these models on the medical field is profound [<xref ref-type="bibr" rid="scirp.128204-ref28">28</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref29">29</xref>] . They offer the potential to augment the expertise of healthcare professionals, expedite diagnosis, reduce errors, and enhance patient outcomes. However, as the healthcare industry embraces the transformative potential of deep learning, it also faces a critical challenge: the protection of patient rights and data security. While deep learning algorithms excel at extracting insights from medical data, the sensitivity and confidentiality of patient information must not be compromised. Ensuring that the healthcare ecosystem remains a trusted guardian of individual privacy is paramount.</p></sec><sec id="s2"><title>2. Literature Review</title><p>Categorization of the Privacy Preservation Models:</p><p><xref ref-type="fig" rid="fig1">Figure 1</xref> shows the schematic block diagram representing the categorization of the privacy preservation models in e-health data.</p><sec id="s2_1"><title>2.1. Schemes Concerning Anonymization N-Based Techniques</title><p>A presented a revolutionary system called Spark that makes use of Apache Spark to efficiently manage large amounts of health care data as well as K-anonymization and L-diversity to protect sensitive personal data. Furthermore, the developed strategy ensures that the shared e-health data does not reveal or isolate the original data before moving to the Hadoop distributed file system (HDFS) [<xref ref-type="bibr" rid="scirp.128204-ref5">5</xref>] .</p><p>In a study, two solutions are offered that protect user privacy in parking recommender systems while analyzing the past parking history utilizing k-anonymity (anonymization) and differential privacy (perturbation) techniques [<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>] . The k-anonymity mechanism specifically creates an anonymized database from an original parking database that contains users’ parking information. The users are indistinguishable in both methods due to differential privacy, which also perturbs the Laplace mechanism’s query response. It is now possible for customers to receive parking place recommendations while maintaining their privacy due to experimental findings on a data set built from actual parking measurements [<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>] .</p><p>In order to use Fully Homomorphic Encryption (FHE) schemes, a presented proxy re-ciphering as a service employing traditional methods such as threshold secret sharing, distributed semi-trusted proxy servers, and chameleon hash function [<xref ref-type="bibr" rid="scirp.128204-ref7">7</xref>] . The effectiveness of the developed strategy is analyzed using real-world data. Furthermore, the strategy’s security characteristics are also analyzed over the general cyber threats, ensuring that the developed method is a sensible, scalable, and easy-to-use strategy for the long-term prevention of sensible data [<xref ref-type="bibr" rid="scirp.128204-ref7">7</xref>] .</p><p>A method to develop a productive anonymous algorithm to maintain the privacy of private data by the data owners [<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>] , privacy frequently occupies a chief role in data mining strategies, and a number of anonymous algorithms introduce privacy in digging data. However, there are limits to privacy protection, and hence a novel strategy of Efficient Anonymous Algorithm (NEAA) is introduced. At first, the entity process’s raw data and the data are preserved in the database. Then, the sensitive data is evaluated using the PCA-based Attribute selection algorithm. The hiding process introduces a novel algorithm based on Anonymous (NBA). Finally, anonymous data can be produced as a result [<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>] . The performance of the developed system is analyzed and found to provide enhanced performance compared to conventional systems [<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>] .</p><p>A presented a privacy-preserving protocol based on keyword search with security for EHR system suggested [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] . Through the use of a keyword search in the cipher text, which is then once more re-encrypted by the cloud using the re-encryption key created by the patient, this method can quickly identify the history of health records that are related. It guarantees that confidential information cannot be disclosed by unauthorized users. An entity-based access control system ensures that only the intended data requester has access to the patients’ medical records [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] .</p><p>A model to recognize how the connection of records can assist in developing the entire patient profiles and thus adjoin importance to the conventional healthcare systems was presented by [<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>] . The usage of data anonymity shows how privacy is fulfilled by specifying the knowledge about the background and further limiting access to factual data. A semantic strategy counting policy formalization, compliance examination, and knowledge discovery was carried out to prevent the risks related to privacy with arbitrary linkages [<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>] .</p><p>The tuple partitioning approach is used in an effective quasi-identifier index-based architecture for privacy preservation over incremental databases on the cloud design [<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] here the merging of columns generates anonymous information for the user. In addition, a packetization strategy is applied to enhance the effectiveness of privacy-preserved information further. The outcomes of the method over a real-world database have proved that the developed model effectively preserves the incremental database of large volumes compared to conventional strategies [<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] .</p></sec><sec id="s2_2"><title>2.2. Schemes Concerning Blockchain-Based Models</title><p>A blockchain structure for managing electronic health records (EHR) could provide the patient with rights and control over the EHRs. The Ancile structure also exposes a blockchain system that achieves a higher degree of decentralization while admitting some nodes as having greater power. According to the evaluation, it is very unlikely that all the data will be covered while maintaining a usable and interoperable model. Ancile, however, still offers a significant amount of data integrity and privacy protection due to the use of smart contracts to partition the data [<xref ref-type="bibr" rid="scirp.128204-ref13">13</xref>] .</p><p>A blockchain-based medical data preservation scheme (DPS) ensures the primitiveness and verifiability of EHRs while preserving privacy for the data owner [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] . After receiving authorization from the data owner, the data requestor uses the expected keyword from the data provider to analyze the expected EHRs through the EHR consortium blockchain and obtains the re-encryption cipher text from the cloud server. To comprehend data security, privacy preservation, and access control, the technique largely uses searchable encryption and interim proxy re-encryption. Additionally, proof of agreement is developed as a compromise method for consortium blockchain to guarantee the availability of the system [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] . In order to demonstrate the effectiveness of the proposed scheme in terms of computing efficiency, the cryptographic primitives were also emulated and the planned scheme was implemented on the Ethereum platform [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] .</p><p>They suggested the blockchain-based interoperability problem in EHR and determined the number of cross-blockchain-based EHR storage strategies [<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>] . Initially, an EHR privacy-preserving cross-blockchain strategy was presented based on Polka Dot chain technology for EHR circulation among several private blockchains of various hospitals. The issue of removing EHR data from each hospital’s private blockchain is solved by RaaS. Using the developed mechanism constructs it beneficial and effective for the doctors to access the data and for patients to remove the unwanted EHR data when they visit various hospitals, which acts a significant function in the EHR privacy-preserving area and in contravening the remote islands of sharing medical data [<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>] .</p><p>The blockchain-based model was developed for the management of EHR on a distribution network to ensure the eventual privacy of patients’ health records, providing control to the patients to monitor the access of data by others through the developed method [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>] . Gathering and organization of data into Big Data provide a huge possibility of varying the healthcare viewpoint, like in personalization care of patients, the discovery of the drug, the efficiency of the treatment, enhancement in clinical results, and the safety management of the patients. Furthermore, the blockchain offers a proposal for which the EHR of patients is preserved without tempering or any attacks. Then to ensure ultimate isolation and control over access to an EHR sheet on the blockchain, a channeling strategy ensures that the patients accept the entities only within a distributed network to completely access the data [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>] .</p></sec><sec id="s2_3"><title>2.3. Schemes Concerning Intelligent Techniques</title><p>In order to protect patient privacy, the Privacy-Preserving Optimization of Clinical Pathway Query (PPO-CPQ) system uses a safe clinical pathway query on cloud servers for e-healthcare., and the sensitive data corresponding to the hospitals, like treatment, expense, and medication. Under this strategy, a secure and privacy-preserving sub-protocol is initially designed, which constitutes a comparison of privacy-preservation, selection of privacy-preserving stage, and so on, to assure the privacy of the e-Healthcare system. The query is then safely executed using the greedy algorithm, and the system’s efficiency is increased by using the Min-Heap technique [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] .</p><p>Big data privacy has been successfully preserved using an algorithm known as Grey Wolf Optimizer-Cat Swarm Optimization (GWO-CSO). The generated model, which is used to create the k-anonymization database in which k numbers of duplicate records are created within the real database [<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>] , is obtained by changing the update rules of GWO in the presence of the CSO algorithm. In order to provide safe data transmission over to end users, the newly created technique is used in the k-anonymized database to conceal the sensitive information relating to the data owners. The created technique ensures the k-anonymization phenomena by producing the parameters required to build the k-anonymized database ideally [<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>] . The privacy and utility metrics consider evaluating the fitness of the solutions obtained using the developed algorithm [<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>] .</p><p>Mandala and Rao presented the privacy preservation model for private healthcare data [<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>] . The designed model mainly concentrated on introducing an effective sanitizing model to conceal the sensitive information of users. A key was framed and selected optimally using the Adaptive Awareness Probability-based Crow search algorithm (AAP-CSA) approach to conceal the sensitive medical data. Moreover, the designed strategy was analyzed in terms of various attacks with different algorithms to show the effectiveness of the designed method [<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>] .</p></sec><sec id="s2_4"><title>2.4. Schemes Concerning Conventional Techniques</title><p>A privacy-preserving chaos-based privacy-preserving encryption model was designed by to protect patients’ privacy [<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>] . The developed model could protect the images of the patients from a cooperative broker. To secure the essential frames of the data gathered from the wireless capsule endoscopy strategy specifically, a quick probabilistic model was devised and a prioritization method was used. The created model produces encrypted images that are random in nature, which improves computational efficiency. The created methodology also entails processing medical data without any leaks, and protecting patient privacy by allowing only authorized users to access the data [<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>] .</p><p>Yang presented a practical and privacy-preserving prediction scheme to predict disease risk using e-healthcare, named EPDP [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>] . In contrast to the current approaches, the created EPDP successfully completes the two steps of disease risk prediction, such as disease model training and disease prediction, while ensuring improved privacy preservation. Notably, a cryptographic approach is used with a super-increasing order to efficiently collect each disease’s symptoms throughout the disease model training phase. Results from the disease risk prediction phase were evaluated using the bloom filter approach.</p><p>Chamikara modeled a solution to maintain data privacy in significant data distribution and evaluation strategies, a challenge in smart cyber-physical models [<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>] . The developed algorithm called SEAL is used to preserve data privacy. The linear time complexity associated with SEAL assists in working with the continuously increasing big data and data streams effectively. The method attains increased accuracy in classification, scalability, and efficiency while maintaining enhanced privacy and higher attack resistance compared to other methods. The findings indicate that the approach is appropriate for smart cyber-physical environments, including grids, cars, healthcare systems, and homes because it can control the continuous data streams produced by sensors monitoring a single person or a group of people and processing them before sending them to cloud systems for additional analysis [<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>] .</p><p>The remote data integrity checking strategy used the fine-grained update for big data storage [<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>] . The developed strategy attains general processes of modification, insertion, and deletion at any online location level in a file with a mapping relation among the block-level update and the line-level update. The analysis of the method depicts that the developed strategy assists in privacy preservation and public verification. On the other hand, it carries out data integrity with a low reduced cost for communication and computation [<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>] .</p><p>To protect private information, a big data probabilistic approach based on clustering was deployed in such a way as to obtain maximum privacy and minimum perturbation. In this model, sensitive data is preserved after recognizing the confidential information from the data clusters to adjust or generalize them. The resultant database is examined to evaluate the level of accuracy of the model concerning hidden data and loose data due to reconstruction. Results demonstrate that the developed clustering-based Privacy preservation strategy in big data led to successful reconstruction [<xref ref-type="bibr" rid="scirp.128204-ref23">23</xref>] .</p></sec><sec id="s2_5"><title>2.5. Risk Outcomes towards Data Privacy and Security Management</title><p>In accordance with [<xref ref-type="bibr" rid="scirp.128204-ref24">24</xref>] , if safe encryption is used in devices or if the internet service provider or network observers analyze the internet traffic from the smart homes connected to the IoT devices, they can gather sensitive information about the activities that take place at home. To prevent the gadgets from becoming inoperable, the user must not block outgoing traffic from their residence, an example of privacy risk outcomes associated with IoT applications in smart homes. IoT’s privacy risks and threats can be user identification, user tracking, profiling, utility controlling, and monitoring. From the privacy perspective, the threat associated with user identification is the ability of the device to distinguish or reveal the identity of the person based on acquired data like name, address, or any such personal information. Such a threat aggravates other associated threats like tracking and profiling individuals’ behavior.</p><p>The high volume of healthcare data creates a big challenge, the desire for scalable storage and support for distributed queries across multiple data sources. Specifically, the challenge is being able to locate and mine specific pieces of data in an enormous, partially structured dataset [<xref ref-type="bibr" rid="scirp.128204-ref25">25</xref>] . The study results showed that privacy, familiarity, and security levels affected users’ trust in using IoT devices. Due to the privacy and security concerns associated with IoT, the amount of trust in devices also had an impact on people’s perceptions of risk and their attitudes towards using it.</p><p>According to a recent study conducted IoT devices are increasingly becoming pervasive in our everyday life, so it is important to understand the underpinning privacy and security risks associated with them [<xref ref-type="bibr" rid="scirp.128204-ref26">26</xref>] . These risk factors result in attacks by cyber attackers faced by consumers using IoT devices.</p><p>IoT has tremendous benefits but presents a reminder that IoT has security and privacy implications. The health data is susceptible, and its granularity poses significant challenges to the anonymization of personal information and thereby exposes consumers to data security and privacy risks. Unauthorized people can intrude into IoT data and use them in authorized ways. Moreover, the increased reliance upon big data and IoT-based devices heightens the risk of security threats and a vulnerability point for intruders to access users’ personal information [<xref ref-type="bibr" rid="scirp.128204-ref27">27</xref>] . In addition to security threats, IoT devices are vulnerable due to many factors. Firstly, the IT manufacturers of IoT devices are inexperienced regarding the risks related to data security relative to hardware or software items. Secondly, the security measures like encryption also need to be fully considered, and thirdly, it is hard to update these devices periodically with security fixes.</p></sec></sec><sec id="s3"><title>3. Analysis and Discussion</title><p>This section examines the method for multi-institution data that preserves privacy. The parts that follow analyze the study based on the parameter measurements.</p><sec id="s3_1"><title>3.1. Search Strategy</title><p>To conduct a comprehensive survey on the topic of privacy preservation for multi-institutional data, a systematic search strategy was employed across a diverse set of journals and sources. The search primarily focuses on the following key databases and journals: IEEE, Wiley, Elsevier, Arxiv, Future Generation Computer Systems, International Journal of Advanced Computer Science, Springer, and Berkeley Technology Law Journal. Additionally, relevant proceedings from the 2018 workshop and publications in the International Journal and Elementary Education are included.</p><sec id="s3_1_1"><title>3.1.1. Search String</title><p>The search string involves using a combination of relevant keywords and phrases, such as privacy preservation, multi-institutional privacy, data security, multi-institutional data, healthcare, confidentiality, and institutional privacy safeguards. A backward and forward citation tracking approach also is employed to identify seminal papers and relevant references. This search approach aims to encompass a broad range of sources and perspectives to provide a comprehensive overview of privacy preservation practices in multi-institutional contexts. By specifying the publication year range from 2016 to 2023, the search retrieves relevant and up-to-date literature on this important topic.</p></sec><sec id="s3_1_2"><title>3.1.2. Inclusion Criteria</title><p>The Selected papers primarily focus on methods, techniques, or frameworks related to the preservation of privacy within the context of multi-institutional data, ensuring that the research is directly aligned with the survey’s subject matter. Articles published between 2016 and 2023 included allowing for an up-to-date understanding of the evolving landscape of privacy preservation in multi-institutional data. The survey considers research articles, peer-reviewed conference papers, books, and relevant reports with rigorous academic or professional analysis.</p></sec><sec id="s3_1_3"><title>3.1.3. Exclusion Criteria</title><p>Studies published before the year 2016 will be excluded from the survey. This criterion ensures that focus on the most recent research and developments in privacy preservation, considering that older publications may not reflect current practices and technologies. Exclude articles that do not have direct involvement or expertise in multi-institutional data management, as their input may not contribute to meaningful insights. Exclude papers with incomplete or missing answers to key questions, as these can introduce bias and reduce the reliability of the findings.</p></sec></sec><sec id="s3_2"><title>3.2. Analysis Based on the Performance Metrics</title><p>The analysis is conducted using parameter measurements that have been employed by numerous academics to demonstrate the model’s efficacy. A number of metrics are observed, including MAE, RMSE, information loss, time, computation cost, communication cost, search time in seconds, search time per identifier in seconds, communication overhead, information loss, classification accuracy, attack range, and cost function NPCR-number of pixel change rate, UACI-unified average changing in tensity tests, minimum std, average overhead bandwidth. Observations reveal that the metrics time, computation cost, and range of attacks as shown in <xref ref-type="table" rid="table1">Table 1</xref> are frequently used by researchers. The measures that the reviewers used are explained in <xref ref-type="fig" rid="fig2">Figure 2</xref>.</p></sec><sec id="s3_3"><title>3.3. Analysis Based on Publication Year</title><p>Depending on the journal’s publication year, the analysis is done in this section. <xref ref-type="table" rid="table2">Table 2</xref> presents a review of the output from 2013 to 2021. <xref ref-type="fig" rid="fig3">Figure 3</xref> gives detailed information and represents the majority of literary works from the 2020s that are pertinent to the analysis of privacy preservation techniques.</p></sec><sec id="s3_4"><title>3.4. Analysis Based on Journals</title><p>This section evaluates the articles that have been published in the aforementioned publications and are relevant to privacy protection techniques. <xref ref-type="table" rid="table3">Table 3</xref>, shows analysis concerning of journals and their affiliated papers. Journals from</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Analysis concerning metrics</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Metrics</th><th align="center" valign="middle" >Papers</th></tr></thead><tr><td align="center" valign="middle" >MAE</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>]</td></tr><tr><td align="center" valign="middle" >RMSE</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>]</td></tr><tr><td align="center" valign="middle" >Information loss</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>]</td></tr><tr><td align="center" valign="middle" >Time</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>]</td></tr><tr><td align="center" valign="middle" >Computation cost</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>]</td></tr><tr><td align="center" valign="middle" >Communication cost</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>]</td></tr><tr><td align="center" valign="middle" >Search time in seconds</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref23">23</xref>]</td></tr><tr><td align="center" valign="middle" >Search time per identifiers in seconds</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>]</td></tr><tr><td align="center" valign="middle" >Communication overhead</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>]</td></tr><tr><td align="center" valign="middle" >Information loss</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>]</td></tr><tr><td align="center" valign="middle" >Classification accuracy</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>]</td></tr><tr><td align="center" valign="middle" >Range of attacks</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>]</td></tr><tr><td align="center" valign="middle" >Cost function</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>]</td></tr><tr><td align="center" valign="middle" >NPCR-number of pixel change rate</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>]</td></tr><tr><td align="center" valign="middle" >UACI-unified average changing in tensity tests</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>]</td></tr><tr><td align="center" valign="middle" >Minimum std</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>]</td></tr><tr><td align="center" valign="middle" >Average overhead bandwidth</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref24">24</xref>]</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Analysis concerning publication year</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Year of Publications</th><th align="center" valign="middle" >Papers</th></tr></thead><tr><td align="center" valign="middle" >2021</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>]</td></tr><tr><td align="center" valign="middle" >2020</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref23">23</xref>]</td></tr><tr><td align="center" valign="middle" >2019</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>]</td></tr><tr><td align="center" valign="middle" >2018</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref24">24</xref>]</td></tr><tr><td align="center" valign="middle" >2016</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref4">4</xref>]</td></tr><tr><td align="center" valign="middle" >2013</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>]</td></tr></tbody></table></table-wrap><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Analysis concerning journals</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Journal Names</th><th align="center" valign="middle" >Papers</th></tr></thead><tr><td align="center" valign="middle" >IEEE</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>]</td></tr><tr><td align="center" valign="middle" >Wiley</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref7">7</xref>]</td></tr><tr><td align="center" valign="middle" >Elementary Education</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>]</td></tr><tr><td align="center" valign="middle" >Elsevier</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>]</td></tr><tr><td align="center" valign="middle" >International Journal</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>]</td></tr><tr><td align="center" valign="middle" >Arxiv</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>]</td></tr><tr><td align="center" valign="middle" >Future Generation Computer Systems</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>]</td></tr><tr><td align="center" valign="middle" >International Journal of Advanced Computer Science</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref23">23</xref>]</td></tr><tr><td align="center" valign="middle" >In Proceedings of the 2018 Workshop</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref24">24</xref>]</td></tr><tr><td align="center" valign="middle" >Springer</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref1">1</xref>]</td></tr><tr><td align="center" valign="middle" >Berkeley Technology Law Journal</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref4">4</xref>]</td></tr></tbody></table></table-wrap><p>the IEEE, Wiley, Springer, elementary education, Elsevier, international journals, arxiv, and next generation computer system can all be used to get publications about privacy preservation approaches. The vast majority of publications from Elsevier and IEEE have extensive analyses and reviews, which are interpreted in <xref ref-type="fig" rid="fig4">Figure 4</xref>.</p></sec><sec id="s3_5"><title>3.5. Analysis Based on Dataset</title><p>Based on the datasets used by various researchers, <xref ref-type="table" rid="table4">Table 4</xref> offers an interpretation of the analysis. Several datasets, including Apache Spark, Real Word, Mockaroo, EHR, Eron Email, Medical Record, Standard Adult, and others, are assessed. <xref ref-type="fig" rid="fig5">Figure 5</xref> provides a description of the datasets used by the reviewers.</p></sec></sec><sec id="s4"><title>4. Research Gaps and Challenges</title><p>➢ Privacy protection is of significant consideration in big data, and hence demanding resilient strategies is necessary to safeguard the customers’ privacy.</p><p>➢ Apart from privacy concerns, efficiency issues, such as the communication overhead and computational cost among the providers and the servers, must also be considered.</p><p>➢ Achieving a better level of security while maintaining reasonable computational complexity remains an onerous task with the transmission of medical images in real-time applications.</p><p>➢ The expanded dimensions of e-health data are one of the major issues with demanding privacy. The complexity of traditional data encryption techniques prevents them from being used in modern applications or for real-time image transmission.</p><p>➢ Satisfying the demands of an image encryption model in terms of greater security to reduce processing complexity is a difficult task.</p><p>➢ Sensitive data in the healthcare industry must be kept private in order to avoid interfering with patient privacy or the activities of healthcare associations.</p><p>➢ The preservation of privacy over incremental data sets is still tricky in the cloud framework, as most data sets are large in volume and scattered across multiple storage nodes.</p><table-wrap id="table4" ><label><xref ref-type="table" rid="table4">Table 4</xref></label><caption><title> Analysis concerning dataset</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Dataset</th><th align="center" valign="middle" >Research Papers</th></tr></thead><tr><td align="center" valign="middle" >Apache Spark</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref5">5</xref>]</td></tr><tr><td align="center" valign="middle" >Real world Data Set</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref6">6</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref24">24</xref>]</td></tr><tr><td align="center" valign="middle" >Mockaroo Database</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref8">8</xref>]</td></tr><tr><td align="center" valign="middle" >EHR Database</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref15">15</xref>]</td></tr><tr><td align="center" valign="middle" >Eron Email Dataset</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref10">10</xref>]</td></tr><tr><td align="center" valign="middle" >Medical Record Datasets</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref16">16</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref22">22</xref>]</td></tr><tr><td align="center" valign="middle" >Standard Adult Dataset</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref17">17</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref19">19</xref>]</td></tr><tr><td align="center" valign="middle" >Autism-Adolescent Dataset</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref18">18</xref>]</td></tr><tr><td align="center" valign="middle" >Synthetic Dataset</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref20">20</xref>]</td></tr><tr><td align="center" valign="middle" >UCI ML Data</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref21">21</xref>]</td></tr><tr><td align="center" valign="middle" >Big Dataset</td><td align="center" valign="middle" >[<xref ref-type="bibr" rid="scirp.128204-ref23">23</xref>]</td></tr></tbody></table></table-wrap><p>➢ The privacy preservation using query set consumes more time and more resources to carry out the computation and hence cannot be used in real-time applications.</p><p>➢ In distributed data mining, certain sum, scalar product, secure set union, and set intersection are a few of the operations regarded as the fundamental operations. Due to the increasing computational complexity, they are unable to offer enough data utility and are unreasonable for privacy-preserving data mining (PPDM).</p><p>➢ Most conventional authentication techniques) could not provide enhanced security and performance characteristics to prevent potential attacks.</p><p>➢ An authentication protocol based on ECC developed showed formal and informal security studies to confirm security efficiency. However, the developed strategy cannot avoid the attacks, such as stolen-verifier and offline password-guessing attack.</p><table-wrap id="table5" ><label><xref ref-type="table" rid="table5">Table 5</xref></label><caption><title> Analysis of privacy preservation in multi-institutional data</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Paper No</th><th align="center" valign="middle" >Data Acquisition</th><th align="center" valign="middle" >Applications</th><th align="center" valign="middle" >Parameter</th><th align="center" valign="middle" >Advantage</th><th align="center" valign="middle" >Disadvantage</th><th align="center" valign="middle" >Solution</th></tr></thead><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >EHR</td><td align="center" valign="middle" >IOT</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Blockchain-based EHR sharing gives patients greater control over their health records, allowing them to manage access and permissions securely.</td><td align="center" valign="middle" >Implementing blockchain-based EHR systems requires a good understanding of blockchain technology and may be complex for healthcare organizations with limited expertise.</td><td align="center" valign="middle" >Creating user-friendly interfaces and tools for patients and healthcare providers can simplify the use of blockchain-based EHR systems.</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >Real Time</td><td align="center" valign="middle" >IOT Based Health Care</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >helps in understanding user attitudes towards IoT-based healthcare and the impact of privacy and security on trust.</td><td align="center" valign="middle" >Privacy and security concerns associated with IoT can affect users’ trust in using IoT devices.</td><td align="center" valign="middle" >Manufacturers should implement transparent privacy measures in IoT devices and educate users about them to build trust.</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >Real-Time</td><td align="center" valign="middle" >Consumer IoT</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The research highlights IoT device privacy and security risks, encouraging a proactive response.</td><td align="center" valign="middle" >As IoT devices become more pervasive, the risk of privacy and security breaches may rise.</td><td align="center" valign="middle" >Continuous monitoring and regular updates of IoT devices can help mitigate evolving security threats.</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >Real Time</td><td align="center" valign="middle" >IOT</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The challenges of health data anonymization and the potential for unauthorized access to IoT data.</td><td align="center" valign="middle" >IoT devices are vulnerable due to factors such as inexperienced manufacturers and inadequate security measures.</td><td align="center" valign="middle" >Requires data security training and must embed strong encryption and security features into IoT devices.</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >Eron Email Dataset</td><td align="center" valign="middle" >Healthcare Big Data</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The Spark system efficiently manages large healthcare datasets, improving data processing speed and scalability.</td><td align="center" valign="middle" >Implementing Apache Spark and advanced anonymization techniques may be technically challenging for some organizations.</td><td align="center" valign="middle" >Invest in optimizing the computational resources required for Spark and anonymization processes.</td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >Real Parking Dataset</td><td align="center" valign="middle" >Smart Parking System</td><td align="center" valign="middle" >Time stamp</td><td align="center" valign="middle" >K-anonymity and differential privacy techniques protect user privacy in parking recommender systems.</td><td align="center" valign="middle" >Anonymization techniques can impact the quality and utility of data.</td><td align="center" valign="middle" >Research and develop methods to balance privacy protection with data utility.</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >TELE ECG Database</td><td align="center" valign="middle" >Cloud-IoT</td><td align="center" valign="middle" >Minimum number of proxy servers in a homomorphic encryption.</td><td align="center" valign="middle" >The tuple partitioning approach effectively preserves privacy over incremental databases on the cloud.</td><td align="center" valign="middle" >Implementing tuple partitioning and packetization may be complex.</td><td align="center" valign="middle" >Explore simplified implementations of privacy-preserving strategies for broader use.</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >Mockaroo Database</td><td align="center" valign="middle" >Data Mining</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The Efficient Anonymous Algorithm (NEAA) provides enhanced privacy for private data.</td><td align="center" valign="middle" >Implementing privacy algorithms may require technical expertise.</td><td align="center" valign="middle" >Continuously refine NEAA and similar algorithms to improve privacy protection.</td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >Electronic Health Record (EHR)</td><td align="center" valign="middle" >Healthcare</td><td align="center" valign="middle" >System parameters</td><td align="center" valign="middle" >A prototype demonstrates the practical applicability of the established protocol.</td><td align="center" valign="middle" >Implementing mutual anonymous authentication protocols may require technical expertise.</td><td align="center" valign="middle" >Offer training and resources to facilitate the implementation of authentication protocols.</td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >Eron Email Dataset</td><td align="center" valign="middle" >Blockchain</td><td align="center" valign="middle" >Security parameter</td><td align="center" valign="middle" >The proposed bloom filter-enabled multi-keyword search protocol emphasizes privacy preservation. By reducing the exposure of intermediate results, it minimizes the risk of service peers or other entities accessing sensitive information associated with the encrypted keywords.</td><td align="center" valign="middle" >While the protocol shows promise in a simulated environment, its real-world applicability and robustness may need further validation and testing in actual blockchain systems, which could present unforeseen challenges.</td><td align="center" valign="middle" >To address the complexity issue, clear and comprehensive documentation, along with training resources, should be made available to facilitate the implementation and operation of the protocol.</td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >Adult dataset from UCI Machine Learning Repository</td><td align="center" valign="middle" >Cloud Computing</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Scalability is a key advantage, particularly in cloud environments where data can be distributed across multiple storage nodes.</td><td align="center" valign="middle" >While the approach offers scalability and efficiency, it may also introduce complexity due to the need for indexing and specialized algorithms.</td><td align="center" valign="middle" >user-friendly tools and interfaces that allow healthcare organizations to easily implement the proposed approach without requiring extensive technical expertise.</td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >EHR</td><td align="center" valign="middle" >Blockchain</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Proxy re-ciphering and traditional methods enhance data security for sensitive information.</td><td align="center" valign="middle" >Some encryption methods may demand significant computational resources.</td><td align="center" valign="middle" >Invest in optimizing resource-intensive encryption techniques.</td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >EHR</td><td align="center" valign="middle" >Blockchain</td><td align="center" valign="middle" >System parameters</td><td align="center" valign="middle" >The protocol based on keyword search enhances data security and privacy in the EHR system.</td><td align="center" valign="middle" >Certain encryption techniques may require significant computational resources.</td><td align="center" valign="middle" >Explore resource-efficient encryption methods.</td></tr><tr><td align="center" valign="middle" >14</td><td align="center" valign="middle" >EHR</td><td align="center" valign="middle" >Blockchain</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The paper addresses the issue of cross-blockchain-based EHR storage strategies, which can improve interoperability between healthcare institutions. This ensures that patients can access their EHR data efficiently, even when visiting different hospitals.</td><td align="center" valign="middle" >Implementing cross-blockchain solutions based on Polka Dot chain technology and RaaS may involve technical complexities. Healthcare organizations may require specialized expertise for deployment.</td><td align="center" valign="middle" >Implementing cross-blockchain solutions based on Polka Dot chain technology and RaaS may involve technical complexities. Healthcare organizations may require specialized expertise for deployment.</td></tr><tr><td align="center" valign="middle" >15</td><td align="center" valign="middle" >EHR</td><td align="center" valign="middle" >Blockchain</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The blockchain-based model proposed in this paper provides patients with control over their health records. Patients can monitor data access and ensure data integrity, enhancing patient empowerment.</td><td align="center" valign="middle" >Integrating existing healthcare systems with blockchain technology can be challenging, particularly in large, established healthcare organizations.</td><td align="center" valign="middle" >Designing user-friendly interfaces and tools for patients and healthcare providers can simplify the use of blockchain-based EHR systems.</td></tr><tr><td align="center" valign="middle" >16</td><td align="center" valign="middle" >International Classification of Diseases (ICD)</td><td align="center" valign="middle" >E-Health Care System</td><td align="center" valign="middle" >Minimum information loss, total number of clusters, cluster centroid</td><td align="center" valign="middle" >The development of privacy-preserving sub-protocols demonstrates a systematic approach to privacy protection, making it easier to adapt the scheme to various clinical scenarios.</td><td align="center" valign="middle" >Privacy-preserving protocols, while effective, can be complex to implement and require a thorough understanding of cryptographic techniques. This complexity may hinder adoption by healthcare professionals.</td><td align="center" valign="middle" >Continued research into optimizing k-anonymization techniques can reduce resource requirements while maintaining strong privacy guarantees.</td></tr><tr><td align="center" valign="middle" >17</td><td align="center" valign="middle" >Adult Dataset</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Awareness probability</td><td align="center" valign="middle" >The proposed framework prioritizes patient data privacy by utilizing local differential privacy, ensuring that sensitive patient information remains secure during the collaborative training process.</td><td align="center" valign="middle" >Implementing a fog-based federated framework with differential privacy can be technically challenging, requiring expertise in both healthcare and privacy-preserving machine learning.</td><td align="center" valign="middle" >Continuous research and development efforts can focus on optimizing the communication protocols and algorithms used in federated learning to reduce overhead and improve efficiency.</td></tr><tr><td align="center" valign="middle" >18</td><td align="center" valign="middle" >Autism-Adolescent Dataset</td><td align="center" valign="middle" >Health Care</td><td align="center" valign="middle" >Awareness probability</td><td align="center" valign="middle" >The blockchain-oriented privacy-preserving EHR sharing protocol ensures that only authorized data requestors can access sensitive EHRs, protecting patient privacy effectively.</td><td align="center" valign="middle" >The cryptographic operations involved in the protocol may introduce computational overhead, potentially affecting system performance. This complexity can be a limitation in resource-constrained environments.</td><td align="center" valign="middle" >Researchers can focus on optimizing the cryptographic primitives used in the protocol to reduce computational overhead and enhance efficiency. This can make the solution more practical for real-world implementation.</td></tr><tr><td align="center" valign="middle" >19</td><td align="center" valign="middle" >WCE Dataset</td><td align="center" valign="middle" >IoT-E Health Care</td><td align="center" valign="middle" >Trajectory</td><td align="center" valign="middle" >The model employs chaos-based privacy-preserving encryption to protect patients’ privacy effectively. It ensures that sensitive patient images remain confidential and secure.</td><td align="center" valign="middle" >Implementing advanced encryption techniques can be computationally intensive. This may pose challenges in resource-constrained healthcare environments.</td><td align="center" valign="middle" >To address computational intensity, research can focus on optimizing encryption algorithms for efficiency, making them more suitable for healthcare applications.</td></tr><tr><td align="center" valign="middle" >20</td><td align="center" valign="middle" >Acute Inflammations Dataset, Real Time Dataset</td><td align="center" valign="middle" >E-Healthcare</td><td align="center" valign="middle" >Bilinear pairing parameters</td><td align="center" valign="middle" >The usage of data anonymity enhances privacy by limiting access to factual data.</td><td align="center" valign="middle" >Implementing data anonymity and semantic strategies may be complex.</td><td align="center" valign="middle" >Offer training and guidance on the implementation of data anonymity and semantic strategies.</td></tr><tr><td align="center" valign="middle" >21</td><td align="center" valign="middle" >PABIDOT Perturbs</td><td align="center" valign="middle" >Health Care</td><td align="center" valign="middle" >perturbation parameters</td><td align="center" valign="middle" >SEAL algorithm ensures data privacy in significant data distribution and evaluation strategies while maintaining efficiency, scalability, and higher attack resistance.</td><td align="center" valign="middle" >Implementing advanced privacy-preserving algorithms may require expertise and careful configuration.</td><td align="center" valign="middle" >Developing user-friendly interfaces for complex algorithms can facilitate adoption in healthcare settings.</td></tr><tr><td align="center" valign="middle" >22</td><td align="center" valign="middle" >Data Access Control System</td><td align="center" valign="middle" >Health Care</td><td align="center" valign="middle" >Security parameter</td><td align="center" valign="middle" >The fine-grained update strategy ensures data integrity and privacy preservation while minimizing communication and computation costs.</td><td align="center" valign="middle" >Implementing fine-grained updates may introduce complexity in data management.</td><td align="center" valign="middle" >Developing clear implementation guidelines can assist healthcare organizations in effectively adopting such strategies.</td></tr><tr><td align="center" valign="middle" >23</td><td align="center" valign="middle" >Adult’s Dataset and Bank Marketing Dataset</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >The clustering-based Privacy Preservation strategy in big data led to successful data reconstruction, ensuring privacy and utility.</td><td align="center" valign="middle" >Data clustering and generalization may lead to some loss of data granularity.</td><td align="center" valign="middle" >Research can focus on optimizing data generalization techniques to minimize data loss.</td></tr><tr><td align="center" valign="middle" >24</td><td align="center" valign="middle" >Real Time</td><td align="center" valign="middle" >Smart Home IoT</td><td align="center" valign="middle" >Independent link padding (ILP) or dependent link padding (DLP), control padding and fragmentation parameters.</td><td align="center" valign="middle" >privacy risks associated with IoT applications in smart homes, shedding light on potential threats and concerns.</td><td align="center" valign="middle" >Users may not be able to block outgoing traffic from IoT devices, posing privacy risk</td><td align="center" valign="middle" >IoT devices should provide users with robust privacy settings to control outgoing traffic effectively.</td></tr><tr><td align="center" valign="middle" >25</td><td align="center" valign="middle" >Block Chain</td><td align="center" valign="middle" >Educational Institution</td><td align="center" valign="middle" >Local parameters</td><td align="center" valign="middle" >The system prioritizes privacy by allowing users to have control over their credentials. This is crucial for protecting sensitive student information.</td><td align="center" valign="middle" >Integrating the blockchain solution with existing education systems and institutions may pose challenges, as standardization and compatibility issues can arise.</td><td align="center" valign="middle" >Establish interoperability standards to facilitate the integration of blockchain solutions with existing education systems seamlessly.</td></tr><tr><td align="center" valign="middle" >26</td><td align="center" valign="middle" >COVID-19 CT Datasets: COVID-19-CTSeg, Mos MedData Dataset</td><td align="center" valign="middle" >IoMT</td><td align="center" valign="middle" >Model weights</td><td align="center" valign="middle" >Collaborative training using data from multiple institutions enhances the robustness and generalizability of deep learning models in medical imaging, making them more effective in diverse clinical settings.</td><td align="center" valign="middle" >Collaborative learning involves data exchange among institutions, which can result in increased communication overhead and potential latency issues.</td><td align="center" valign="middle" >Developing efficient communication protocols and data compression techniques can reduce communication overhead and address latency concerns.</td></tr><tr><td align="center" valign="middle" >27</td><td align="center" valign="middle" >National Lung Screening Trial, Medical Imaging Data Resource Center, BRATS, and Alzheimer Disease Neuroimaging Initiative,</td><td align="center" valign="middle" >Medical Diagnosis</td><td align="center" valign="middle" >Uantitative parameters,</td><td align="center" valign="middle" >Federated Learning (FL) allows the development of deep learning models across multiple centers without direct data sharing, addressing privacy concerns and legal/ethical issues associated with centralized datasets.</td><td align="center" valign="middle" >Implementing FL can be complex, requiring coordination among multiple centers and setting up secure communication channels. It may also demand substantial computational resources.</td><td align="center" valign="middle" >Establishing common data acquisition and reconstruction protocols across centers can mitigate data heterogeneity and enhance model generalization.</td></tr><tr><td align="center" valign="middle" >28</td><td align="center" valign="middle" >Real Time</td><td align="center" valign="middle" >Health Care</td><td align="center" valign="middle" >Local parameters</td><td align="center" valign="middle" >FL models can be trained on diverse datasets from different clinical centers, leading to more generalizable models that perform well across a variety of imaging protocols and patient populations.</td><td align="center" valign="middle" >FL involves iterative communication between centers and a central server, which can lead to increased communication overhead and potentially slower convergence compared to centralized training.</td><td align="center" valign="middle" >Developing efficient communication protocols and strategies for FL can reduce communication overhead and speed up convergence.</td></tr><tr><td align="center" valign="middle" >29</td><td align="center" valign="middle" >Real Time</td><td align="center" valign="middle" >Health Care</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Synthetic datasets aim to mimic real data, ensuring that researchers can still derive valuable insights and conduct meaningful analyses without access to the original, sensitive data.</td><td align="center" valign="middle" >The synthetic data may not capture fine-grained details present in the original data, potentially limiting certain types of analyses.</td><td align="center" valign="middle" >Fine-tune synthetic data generation models to specific research objectives and datasets to achieve better mimicry.</td></tr><tr><td align="center" valign="middle" >30</td><td align="center" valign="middle" >Real Time</td><td align="center" valign="middle" >IoHT</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Local deep-learning models are trained collaboratively, reducing the need for centralized data collection, which can be time-consuming and resource-intensive.</td><td align="center" valign="middle" >Implementing and managing a fog-based federated framework can be complex, requiring specialized expertise and infrastructure.</td><td align="center" valign="middle" >Continuous research and development in privacy-preserving techniques can help bolster security in federated learning, ensuring patient data remains confidential.</td></tr></tbody></table></table-wrap><p>➢ An anonymization model, known as optimal balancing scheduling, based on Map Reduce strategy introduced to overcome scalability. The method assists in enhancing re-anonymization and better handling the problems associated with data locality. However, the model failed to deal with the security issue during privacy preservation.</p><p>The analysis based on Data Acquisition, Applications, Parameters, Advantages, Disadvantages, and Solution for the privacy preservation of multi-institutional data are tabulated in <xref ref-type="table" rid="table5">Table 5</xref>.</p>Discussion on Laws, Regulations and Policies<p>In the realm of healthcare data management and privacy protection, adherence to regulations and standards is paramount to safeguard patient information. HIPAA establishes a comprehensive framework for protecting patient health information through security measures, privacy policies, and data breach protocols. GDPR imposes strict privacy requirements on handling personal health data, emphasizing transparency and individual rights [<xref ref-type="bibr" rid="scirp.128204-ref27">27</xref>] . Challenges and Limitations of Cross-Border Data Sharing. Varying global privacy laws complicate harmonizing standards for international data sharing. Stringent data sovereignty laws mandate local storage, hindering seamless cross-border data transfer. Collaboration among healthcare entities and policymakers fosters standardized cross-border data-sharing protocols [<xref ref-type="bibr" rid="scirp.128204-ref26">26</xref>] [<xref ref-type="bibr" rid="scirp.128204-ref29">29</xref>] .</p></sec><sec id="s5"><title>5. Conclusion</title><p>In this research, various kinds of literature to clarify the issues associated with the privacy preservation of e-Health data have been investigated. Multiple techniques such as encryption, sanitation, and anonymization are evaluated to enhance the privacy of the data transmitted through the IoT platform. A brief study of the anonymization strategies, optimization algorithms, and blockchain-based strategies used to solve the problems in the privacy preservation of the sensitive medical data of patients has been done. The challenges associated with existing methods are also analyzed in detail to solve the problems with the development of the proposed method. Briefly, the research gaps and issues related to the existing techniques are analyzed and presented in this section. Hence, protecting patient data privacy is a prerequisite for data management and sharing to safeguard the patients whose medical data is shared in clinical research while making information available for future research.</p></sec><sec id="s6"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s7"><title>Cite this paper</title><p>Patel, S., Patel, R., Akbari, A. and Mukkala, S.R. (2023) An Extensive Study and Review of Privacy Preservation Models for the Multi-Institutional Data. Journal of Information Security, 14, 343-365. https://doi.org/10.4236/jis.2023.144020</p></sec></body><back><ref-list><title>References</title><ref id="scirp.128204-ref1"><label>1</label><mixed-citation publication-type="book" xlink:type="simple">Sébastien, Z., Crettaz, C., Kim, E., Skarmeta, A., Bernabe, J.B., Trapero, R. and Bianchi, S. (2019) Privacy and Security Threats on the Internet of Things. In: Ziegler, S., Ed., Internet of Things Security and Data Protection, Springer, Berlin, 9-43. https://doi.org/10.1007/978-3-030-04984-3_2</mixed-citation></ref><ref id="scirp.128204-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Naser, A.M., Farooque, M.M.J. and Khashab, B.L. (2019) The Effect of Security, Privacy, Familiarity, and Trust on Users’ Attitudes toward the Use of the IoT-Based Healthcare: The Mediation Role of Risk Perception. IEEE Access, 7, 111341-111354. https://doi.org/10.1109/ACCESS.2019.2904006</mixed-citation></ref><ref id="scirp.128204-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Tejasvi, A., Chamola, V., Sikdar, B. and Choo, K.-K.R. (2020) Consumer IoT: Security Vulnerability Case Studies and Solutions. IEEE Consumer Electronics Magazine, 9, 17-25. https://doi.org/10.1109/MCE.2019.2953740</mixed-citation></ref><ref id="scirp.128204-ref4"><label>4</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Swaroop</surname><given-names> P. </given-names></name>,<etal>et al</etal>. (<year>2016</year>)<article-title>Internet of Things: Underlying Technologies, Interoperability, and Threats to Privacy and Security</article-title><source> Berkeley Technology Law Journal</source><volume> 31</volume>,<fpage> 997</fpage>-<lpage>1022</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.128204-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Suneetha, V., Suresh, S. and Jhananie, V. (2020) A Novel Framework Using Apache Spark for Privacy Preservation of Healthcare Big Data. 2020 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA), Bangalore, 5-7 March 2020, 743-749. https://doi.org/10.1109/ICIMIA48430.2020.9074867</mixed-citation></ref><ref id="scirp.128204-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Saleem, Y., Rehmani, M.H., Crespi, N. and Minerva, R. (2021) Parking Recommender System Privacy Preservation through Anonymization and Differential Privacy. Engineering Reports, 3, e12297. https://doi.org/10.1002/eng2.12297</mixed-citation></ref><ref id="scirp.128204-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Shruthi, R. and Govindarasu, M. (2020) An Efficient Framework for Privacy-Preserving Computations on Encrypted IoT Data. IEEE Internet of Things Journal, 7, 8700-8708. https://doi.org/10.1109/JIOT.2020.2998109</mixed-citation></ref><ref id="scirp.128204-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Dhaval, J. and Panchal, R. (2021) A Novel Anonymity Algorithm for Privacy Preservation. Elementary Education Online, 20, 2402-2402.</mixed-citation></ref><ref id="scirp.128204-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Qi, F., He, D., Wang, H., Zhou, L. and Choo, K.-K.R. (2019) Lightweight Collaborative Authentication with Key Protection for Smart Electronic Health Record System. IEEE Sensors Journal, 20, 2181-2196. https://doi.org/10.1109/JSEN.2019.2949717</mixed-citation></ref><ref id="scirp.128204-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Shan, J., Cao, J., McCann, J.A., Yang, Y., Liu, Y., Wang, X. and Deng, Y. (2019) Privacy-Preserving and Efficient Multi-Keyword Search over Encrypted Data on Blockchain. 2019 IEEE International Conference on Blockchain, Atlanta, 14-17 July 2019, 405-410.</mixed-citation></ref><ref id="scirp.128204-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Xuyun, Z., Liu, C., Nepal, S. and Chen, J. (2013) An Efficient Quasi-Identifier Index Based Approach for Privacy Preservation over Incremental Data Sets on Cloud. Journal of Computer and System Sciences, 79, 542-555. https://doi.org/10.1016/j.jcss.2012.11.008</mixed-citation></ref><ref id="scirp.128204-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Dagher, G.G., Mohler, J., Milojkovic, M. and Marella, P.B. (2018) Ancile: Privacy-Preserving Framework for Access Control and Interoperability of Electronic Health Records Using Blockchain Technology. Sustainable Cities and Society, 39, 283-297. https://doi.org/10.1016/j.scs.2018.02.014</mixed-citation></ref><ref id="scirp.128204-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Yong, W., Zhang, A., Zhang, P. and Wang, H. (2019) Cloud-Assisted EHR Sharing with Security and Privacy Preservation via Consortium Blockchain. IEEE Access, 7, 136704-136719. https://doi.org/10.1109/ACCESS.2019.2943153</mixed-citation></ref><ref id="scirp.128204-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, C., Wang, J., Du, X., Zhang, X. and Qin, X. (2020) CEPS: A Cross-Blockchain Based Electronic Health Records Privacy-Preserving Scheme. ICC 2020-2020 IEEE International Conference on Communications (ICC), Dublin, 7-11 June 2020, 1-6.</mixed-citation></ref><ref id="scirp.128204-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Nuetey, N.R., Yue, L., Agdedanu, P.R. and Adjeisah, M. (2019) Privacy Module for Distributed Electronic Health Records (EHRs) Using the Blockchain. 2019 IEEE 4th International Conference on Big Data Analytics (ICBDA), Suzhou, 15-18 March 2019, 369-374.</mixed-citation></ref><ref id="scirp.128204-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, M.W., Chen, Y. and Susilo, W. (2020) PPO-CPQ: A Privacy-Preserving Optimization of Clinical Pathway Query for e-Healthcare Systems. IEEE Internet of Things Journal, 7, 10660-10672. https://doi.org/10.1109/JIOT.2020.3007518</mixed-citation></ref><ref id="scirp.128204-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Suman, M. and Goswami, P. (2021) A Technique for Securing Big Data Using k-Anonymization with a Hybrid Optimization Algorithm. International Journal of Operations Research and Information Systems (IJORIS), 12, 1-21. https://doi.org/10.4018/IJORIS.20211001.oa3</mixed-citation></ref><ref id="scirp.128204-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Jyothi, M. and Rao, C.S. (2019) Privacy Preservation of Data Using Crow Search with Adaptive Awareness Probability. Journal of Information Security and Applications, 44, 157-169. https://doi.org/10.1016/j.jisa.2018.12.005</mixed-citation></ref><ref id="scirp.128204-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Rafik, H., Yan, Z., Muhammad, K., Bellavista, P. and Titouna, F. (2020) A Privacy-Preserving Cryptosystem for IoT E-Healthcare. Information Sciences, 527, 493-510. https://doi.org/10.1016/j.ins.2019.01.070</mixed-citation></ref><ref id="scirp.128204-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Xue, Y., Lu, R., Shao, J., Tang, X. and Yang, H. (2018) An Efficient and Privacy-Preserving Disease Risk Prediction Scheme for e-Healthcare. IEEE Internet of Things Journal, 6, 3284-3297. https://doi.org/10.1109/JIOT.2018.2882224</mixed-citation></ref><ref id="scirp.128204-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Chamikara, M.A.P., Bertok, P., Liu, D., Camtepe, S. and Khalil, I. (2020) Efficient Privacy Preservation of Big Data for Accurate Data Mining. Information Sciences, 527, 420-443. https://doi.org/10.1016/j.ins.2019.05.053</mixed-citation></ref><ref id="scirp.128204-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Chen, Z., Zhang, F., Zhang, P., Liu, J.K., Huang, J., Zhao, H. and Shen, J. (2018) Verifiable Keyword Search for Secure Big Data-Based Mobile Healthcare Networks with Fine-Grained Authorization Control. Future Generation Computer Systems, 87, 712-724. https://doi.org/10.1016/j.future.2017.10.022</mixed-citation></ref><ref id="scirp.128204-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Saira, K., Iqbal, K., Faizullah, S., Fahad, M., Ali, J. and Ahmed, W. (2020) Clustering Based Privacy Preserving of Big Data Using Fuzzification and Anonymization Operation.</mixed-citation></ref><ref id="scirp.128204-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Trisha, D., Apthorpe, N. and Feamster, N. (2018) A Developer-Friendly Library for Smart Home Iot Privacy-Preserving Traffic Obfuscation. Proceedings of the 2018 Workshop on IoT Security and Privacy, Budapest, 20 August 2018, 43-48.</mixed-citation></ref><ref id="scirp.128204-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Fang, R., Pouyanfar, S., Yang, Y., Chen, S.C. and Iyengar, S.S. (2016). Computational Health Informatics in the Big Data Age: A Survey. ACM Computing Surveys (CSUR), 49, 1-36.</mixed-citation></ref><ref id="scirp.128204-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">Abdel-Basset, M., Hawash, H. and Abouhawwash, M. (2022) Collaborative Screening of COVID-19-Like Disease from Multi-Institutional Radiographs: A Federated Learning Approach. Mathematics, 10, 4766.</mixed-citation></ref><ref id="scirp.128204-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">Gupta, S., Kumar, S., Chang, K., Lu, C., Singh, P. and Kalpathy-Cramer, J. (2023) Collaborative Privacy-Preserving Approaches for Distributed Deep Learning Using Multi-Institutional Data. RadioGraphics, 43, e220107. https://doi.org/10.1148/rg.220107</mixed-citation></ref><ref id="scirp.128204-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">Shiri, I., Vafaei Sadr, A., Akhavan, A., Salimi, Y., Sanaat, A., Amini, M. and Zaidi, H. (2023) Decentralized Collaborative Multi-Institutional PET Attenuation and Scatter Correction Using Federated Deep Learning. European Journal of Nuclear Medicine and Molecular Imaging, 50, 1034-1050. https://doi.org/10.1007/s00259-022-06053-8</mixed-citation></ref><ref id="scirp.128204-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple">Sun, H., Plawinski, J., Subramaniam, S., Jamaludin, A., Kadir, T., Readie, A. and Coroller, T. (2023) A Deep Learning Approach to Private Data Sharing of Medical Images Using Conditional Generative Adversarial Networks (GANs). PLOS ONE, 18, e0280316. https://doi.org/10.1371/journal.pone.0280316</mixed-citation></ref></ref-list></back></article>