Traditional database of blockchain
blockchain is a new application mode of distributed data storage, point-to-point transmission, consensus mechanism, encryption algorithm and other computer technologies. Blockchain is an important concept of bitcoin,
in essence, it is a decentralized database. At the same time, as the underlying technology of bitcoin, it is a series of data blocks generated by using cryptographic methods. Each data block contains a batch of information of bitcoin network transactions, Used to verify the validity of its information (anti-counterfeiting) and generate the next block
extended data
most blockchain public chains are limited by scalability. The biggest feature of blockchain technology is decentralization, which requires that all accounts in the network need to deal with the accounting process. Distributed accounting has high security, low misoperation rate, political neutrality and correctness
however, blockchain technology embraces these features at the same time, sacrificing scalability, unable to meet the personalized supervision, and slightly insufficient in protecting data privacy. Moreover, with the increase of the number of ledger, the interaction delay will increase exponentially, that is to say, the more ledger in the blockchain network, the higher the delay
in the near future, the number of devices in the Internet of things will explode, which may be hundreds of billions or trillions. If such a huge network is still managed in a centralized networking mode, the infrastructure investment and maintenance of the data center should be incalculable
in essence, big data belongs to a small branch of database, so this problem is attributed to the relationship between big data and database. Database is a very old research field in software, Internet and it. From the initial file system to ER model, to the three achievements of traditional database, which are known to all, relational model, transaction processing and query optimization, to the rise of NoSQL database after the popularity of Internet, database technology is constantly developing and changing, Then it also includes semi-structured data processing represented by XML and unstructured data processing such as text and voice
it seems that the relationship between blockchain and database is actually such a relationship. From the evolution process of database technology, we can find that it always comes from how to meet new business requirements and then create new data processing technologies. For example, from the beginning of the file system, why do we need the ER model? It is because with the development of the financial instry, people have further requirements for these fast accounting, high concurrency data writing and access, which leads to the emergence and rapid development of the entity relationship model. Why did NoSQL database appear later? It is because the rapid development of the Internet puts forward higher requirements for updating the database, so in essence we think that the whole Internet is a big database
things are always developing. Of course, after we solve the problem of massive real-time data processing on the Internet through NoSQL database and cloud storage, the next problem must come, that is, how to solve the authenticity and effectiveness of data in a large-scale way
for example, it may be related to our diet, from the problem of food and clothing at the beginning, to the problem of nutrition structure, and then to the problem of food safety that everyone is concerned about. The development of database is the same. When we can solve these problems of data storage and data access through ER entity relationship model and NoSQL database, Next, we need to care about and solve the problem of authenticity and effectiveness
so at this stage, these technologies, represented by blockchain, must be a new starting point and new requirements for the authenticity, validity, unforgeability and tamper resistance of data compared with the current database. We can clearly feel that the trend of database and blockchain integration is actually very close and irresistible, just like the movie just mentioned, the procers of content begin to develop in the direction of virtual reality and augmented reality; From the perspective of database, blockchain is a new way of data organization. We believe that big data and blockchain are integrated.
first, data security: blockchain makes data flow "at ease"
with its trustworthiness, security and non tamperability, blockchain liberates more data. A typical case is used to illustrate how blockchain promotes the generation of big data for gene sequencing. Blockchain sequencing can use the private key to limit access rights, so as to avoid the legal restrictions on indivial access to gene data, and use distributed computing resources to complete the sequencing service at low cost. The security of blockchain makes sequencing an instrialized solution, realizing global scale sequencing, thus promoting the massive growth of data< Second, data open sharing: blockchain guarantees data privacy
the government has a large number of high-density and high-value data, such as medical data, population data, etc. The opening of government data is the general trend, which will have an inestimable impetus to the development of the whole economy and society. However, the main difficulty and challenge of data opening is how to open data while protecting personal privacy. Data desensitization technology based on blockchain can ensure data privacy and provide a solution for data opening under privacy protection. Data desensitization technology mainly uses hash processing and other encryption algorithms. For example, enigma based on blockchain technology can protect the privacy of data and eliminate information security problems in data sharing by computing data without accessing the original data. For example, employees of a company can safely open the path to access their salary information and jointly calculate the average salary within the group. Each participant can know his or her relative position in the group, but he or she knows nothing about the salaries of other members< The schematic diagram of data hash desensitization processing
3. Data storage: blockchain is a kind of database storage technology that can not be tampered with, full history, strong endorsement
blockchain technology, through all nodes in the network participate in the calculation together, and verify the authenticity of their information to achieve the consensus of the whole network, it can be said that blockchain technology is a specific database technology. So far, our big data is still in a very basic stage. The data based on the consensus of the whole network and the reliable blockchain data are tamperable and historical, which makes the quality of the data gain unprecedented strong trust endorsement, and also makes the development of the database enter a new era< Data analysis: blockchain ensures data security
data analysis is the core of realizing data value. In data analysis, how to effectively protect personal privacy and prevent core data leakage has become the primary consideration. For example, with the application of fingerprint data analysis and the popularization of genetic data detection and analysis, more and more people worry that once personal health data is leaked, it may lead to serious consequences. Blockchain technology can prevent this kind of situation through multi signature private key, encryption technology and secure multi-party computing technology. When the data is hashed and placed on the blockchain, the use of digital signature technology can enable those authorized people to access the data. The private key not only ensures the privacy of data, but also can be shared with authorized research institutions. The data is stored on the decentralized blockchain and analyzed without accessing the original data. It can not only protect the privacy of the data, but also provide it to the global scientific research institutions and doctors for sharing. As the basic health database of all mankind, it will bring great convenience to solve the sudden and difficult diseases in the future
v. data circulation: blockchain protects the rights and interests related to data
for valuable data assets of indivials or institutions, blockchain can be used to register them. Transaction records are recognized, transparent and traceable throughout the network, making clear the source, ownership, use right and circulation path of big data assets, which is of great value to data asset transaction
on the one hand, blockchain can break the threat of intermediary ing data, which is concive to the establishment of a trusted data asset trading environment. Data is a very special commodity, which is essentially different from ordinary commodities. It is mainly characterized by unclear ownership and "being owned after seeing and ing", which also determines that the traditional commodity intermediary transaction method can not meet the needs of data sharing, exchange and transaction. Because the intermediary center has the conditions and the ability to and save all the flowing data, it is extremely unfair to the data procers. This threat can not be eliminated only by commitment, and the existence of this threat has become a huge obstacle to the flow of data. Based on the decentralized blockchain, it can break the threat of ing data in the intermediary center and protect the legitimate rights and interests of data owners
on the other hand, blockchain provides a traceable path, which can effectively solve the problem of data right confirmation. Through multiple nodes participating in the calculation in the network, blockchain can participate in the calculation and recording of data together, and verify the effectiveness of their information. It can not only carry out information anti-counterfeiting, but also provide a traceable path. The transaction information of each block is strung together to form a complete detailed list of transactions. The context of each transaction is very clear and transparent. In addition, when people have doubts about the "value" of a block, they can easily trace back the historical transaction records, and then judge whether the value is correct, and identify whether the value has been tampered or recorded incorrectly
with the protection of blockchain, big data will naturally become more active
the tokens of crowdfunding projects on coin Ying China platform are all developed based on blockchain technology, and relevant information will be recorded on the blockchain.
at present, the traditional database is centralized recording and storage. Even for remote disaster recovery and cloud storage, it only changes the storage location from one place to multiple places, from local to cloud. If there is a problem with the central server, the disaster recovery database will stop updating data; The blockchain database is distributed record, distributed storage and distributed propagation. Each information is transmitted by a single node to all other nodes in the whole network. Each node is responsible for data recording and storage. No centralized or third-party organization is responsible for management. If one node has problems, other nodes will continue to update and store data, Through the way of decentralization, the stable operation of the system is maintained, and the information is complete and reliable.
distinguish blockchain from ordinary database;, The main point is that the specific rules for putting data into the database are different. In other words, it can not conflict with other data already in the database. It is only added, immutable, and the data itself is locked to an owner (which can be owned). It is replicable and available. Finally, everyone agrees that the state of data in a decentralized database is compliant.