How bad blocks are generated, and what means does SSD u...

Return to site

How bad blocks are generated, and what means does SSD use to seek out and manage undesirable blocks, what challenges exist in the terrible block management strategy advisable by the manufacturer, what kind of management methods will be better, and formatting the challenging disk will result in wrong The bad block table is lost, and what safety hazards will arise following the SSD is repaired, this article will elaborate one particular by a single.

Overview

The negative block management style notion is related to SSD reliability and efficiency. Some NandFlash vendors' poor block management practices may well not be really affordable. When solution style, if some abnormal conditions usually are not regarded as sufficient, it will often result in some unexpected terrible blocks.

For instance, after testing quite a few different most important manage SSDs, Bingge discovered that the issue of newly added bad blocks because of abnormal energy failure is very widespread. Searching for 'abnormal power failure produces terrible blocks' or equivalent keywords and phrases having a search engine The issue isn't only inside the testing course of action, you will discover also many complications that basically occur for the end user.

Who will handle the negative blocks

For the master without having a specific flash file method, the poor blocks is usually managed by the firmware with the SSD controller. For the specific flash file method, the undesirable blocks is often managed by the specific flash file method or Driver.

Negative blocks (BadBlock) are divided into three forms:

1. Ex-factory poor blocks, or initial negative blocks, that is definitely, blocks that do not meet the manufacturer's requirements or fail to meet the manufacturer's published standards at the time of shipment, have already been marked as poor blocks by the manufacturer in the factory; Some can't be Erase;

two. New undesirable blocks or bad blocks attributable to put on in the course of use;

three. Fake terrible blocks which can be misjudged by the key control due to abnormal power failure, etc .;

Not all of the newly added poor blocks are caused by put on. In the event the SSD doesn't have an abnormal power-off protection function, the abnormal power-off could result in the key handle to misjudge the undesirable blocks or make new ones. Devoid of abnormal power-off protection, when the Lowerpage has been effectively programmed, in addition to a sudden power failure during the Upperpage programming approach, it can inevitably result in data transmission errors inside the Lowerpage. If the number of data errors exceeds the SSDECC error correction capability, then it will likely be An error happens in the course of reading, plus the block will be judged as 'BadBlock' by the master and marked within the badblocktable.

Many of the newly added terrible blocks may be Erase, and just after the newly added bad blocks are erased, re-reading, reading and erasing the information could not cause errors again, simply because the error can also be connected for the pattern from the written information, use a certain pattern If a thing goes wrong, it may not be wrong to modify an additional pattern.

The ratio of factory negative blocks inside the complete Device

I've consulted several original NandFlash makers and gave a additional general statement: the ratio of negative blocks in the factory doesn't exceed 2%, and the manufacturer will leave a component with the margin to ensure that even when the maximum quantity of P / E promised by the manufacturer is reached, There is certainly nevertheless a poor block price of no additional than 2%. It seems that it truly is not a simple activity to guarantee 2%. The poor block price when Bingge got a brand new sample exceeded 2%, the actual test was two.55%

Strategy for determining negative blocks

1. Judgment strategy with the factory undesirable blocks

The scanning of undesirable blocks basically scans irrespective of whether the byte corresponding towards the address specified by the manufacturer has the FFh flag, and if there is no FFh, it's a poor block.

The place in the negative block identification is roughly the exact same for each manufacturer. For SLC and MLC, the place is various. Take Micron as an instance:

1.1 For the SLC of smaller pages (528Byte), does the sixth Byte within the sparearea of ​​the initial web page of every block have the FFh flag, if not, it really is a terrible block;

1.2 For SLCs with huge pages (greater than or equal to 2112 Bytes), do the initial and sixth Bytes on the Sparearea of ​​the first page of each Block have the FFh flag, if not, it is a undesirable block;

1.three For MLC, the factory poor blocks are scanned by scanning the initial web page and also the final web page in the 1st and second Bytes of every block to determine in the event the first or second Byte may be the 0xFF flag, which can be 0xFF, which can be so quick, there's no 0xFF It's a negative block.

To borrow a image from Hynixdatasheet to illustrate:

What data is inside the undesirable block? All 0s or all 1s? The results observed by Bingge's test are as follows. Of course, this might not be the truth. The factory undesirable blocks may be true, nevertheless it is not necessary to add new poor blocks, otherwise it truly is not impossible to hide data by way of 'bad blocks'

Can the factory terrible blocks be erased

Some are 'can' erased, and some are prohibited by the manufacturer. The so-called 'can' erase only indicates that the undesirable block identification is usually changed by sending an erase command, rather than suggesting that terrible blocks can be applied.

The manufacturer strongly recommends to not erase the negative block. After the poor block flag is erased, it can't be 'recovered'. Writing information around the bad block is risky.

two. In the procedure of working with, the judgment approach of newly added poor blocks

The newly added terrible block should be to judge no matter if the operation of NandFlash is effective by way of the feedback outcome with the status register. When the Plan or Erase, when the status register feedback is fail, the SSD main handle will list the block as a negative block.

Particularly:

two.1. Error when executing erase command;
2.2. Error when executing write command;
two.3. An error happens when the read command is executed; when the study command is executed, in the event the quantity of bit errors exceeds the error correction capability with the ECC, the block will likely be judged as a undesirable block.

Terrible block management technique

Negative blocks are managed by generating and updating the negative block table (BadBlockTable: BBT). There's no uniform specification and practice for the poor block table. Some engineers use a table to handle the factory poor blocks and newly added negative blocks, some engineers will handle the two tables separately, and a few engineers will treat the initial terrible blocks as separate Table, factory terrible blocks plus new undesirable blocks as yet another table.

For the content material of your undesirable block table, the expression isn't consistent, and some will probably be expressed additional roughly, for example: use 0 to indicate quick, use 1 to indicate undesirable blocks or vice versa. Some engineers will use a extra detailed description, such as: 00 for bad blocks in the factory, 01 for undesirable blocks when Program fails, 10 for undesirable blocks when Read fails, and 11 for poor blocks when Erase fails.

The poor block table is generally saved within a separate region (eg Block0, page0 and Block1, page1). It is actually more effective to read BBT directly immediately after each power-on. Considering that NandFlash itself will also be broken, it may result in the loss of BBT As a result, BBT is normally used for backup processing. The amount of backups is various for each home. A number of people back up two and other individuals back up. Normally, how to repair unreadable flash drive is possible to make use of the probability theory voting program to calculate, regardless of what, no less than Additional than two copies.

Undesirable block management methods typically contain: terrible block skip method and undesirable block replacement technique;

Negative block skip strategy

1. For the initial poor block, the poor block skip will skip the corresponding poor block through BBT and directly retailer the data within the next good block.

2. For the newly added undesirable block, update the terrible block to BBT, transfer the valid information within the terrible block to the next superior block, and skip directly when undertaking the corresponding Read, Plan or Erse in the future This negative block.

Bad block replacement strategy (recommended by a NandFlash vendor)

Bad block replacement refers to replacing terrible blocks generated during use with superior blocks inside the reserved location. Suppose that through the system, the nth web page has an error, then beneath the terrible block replacement approach, the information in page0 to page (n-1) is going to be copied to the same position from the totally free Block (eg BlockD) in the reserved region, Then write the data of the nth web page within the data register towards the pagen in BlockD.

The manufacturer's recommended method is always to divide the entire data location into two components. One part may be the user-visible location, that is used for typical information operations by the user, as well as the other component is really a spare area specially ready for replacing the poor block, which can be made use of to retailer the data for replacing the undesirable block and Save the bad block table, the proportion of your spare location is 2% in the entire capacity.

When a undesirable block is generated, FTL will remap the BadBlock address for the excellent block address inside the reserved area, rather than directly skipping the negative block to the next superior block. Before each write operation towards the logical address, which physical address will likely be calculated very first You'll be able to create which addresses are negative blocks, and if it is a undesirable block, create the data for the address on the corresponding reserved region.

Brother Bing did not see any suggestion about whether 2% on the reserved location ought to be included within the OP location or an more location, nor did he see a description of irrespective of whether the 2% of your reserved location was dynamic or static, and the joining was an independent region And it truly is a static location, then this approach will have the following disadvantages:

1. Straight reserve 2% with the region for undesirable block replacement, that will lessen the offered capacity and waste space. At the very same time, due to the modest number of obtainable blocks, the average number of accessible terrible put on is accelerated; two. Assuming that the available area has more poor blocks At 2%, it suggests that all the reserved places are replaced, and also the bad blocks generated will not be processed, along with the SSD will face the end of life.

Terrible block replacement tactic (the practice of some SSD suppliers)

The truth is, within the true item design, it really is hardly ever seen that a 2% ratio is reserved as a terrible block replacement area. In general, the OP (OverProvison) area freeblock will probably be utilized to replace the new addition during the use course of action. For negative blocks, take garbage collection as an example. When the garbage collection mechanism is operating, first move the valid page data in the Block that should be recovered for the freeBlock, then carry out Erase operation on this Block. Assume that the Erase status register reports that Erase failed. The poor block management mechanism will update this Block address for the new poor block list, in the same time, create the valid information pages inside the bad block towards the FreeBlock within the OP area, update the poor block management table, the next time you create data , Straight skip the negative block towards the subsequent out there block.

Distinct manufacturers have diverse OP sizes, distinct application scenarios, diverse reliability needs, and different OP sizes. There is a trade-off partnership between OP and stability. The bigger the OP, the a lot more garbage is written inside the method of continuous writing. The larger the reclaimed absolutely free space, the additional steady the overall performance plus the smoother the overall performance curve. Conversely, the smaller sized the OP, the worse the performance stability. Needless to say, the larger the user's offered space, the larger the offered space indicates the a lot more expense low.

Generally speaking, OP may be set to 5% -50%, 7% of OP is often a popular ratio, in contrast to the 2% fixed block suggested by the manufacturer, 7% isn't a fixed block to perform OP, Alternatively, it truly is dynamically distributed in all Blocks, which can be more conducive to wear-leveling approaches.

The troubles of SSD repair

For many SSD makers who do not have the master manage technology, when the product is repaired, the usual practice is to replace the faulty device and restart the mass production operation. At this time, the new poor block list are going to be lost, and the new poor block list will probably be lost. This indicates that you will find already bad blocks within the NandFlash that have not been replaced. The operating program or sensitive information may well be written to the terrible block region, which may well bring about the user's operating system to crash. Even to get best usb flash drive recovery software with a master handle, whether it's going to save a list of existing negative blocks for the user depends upon the attitude in the user facing the manufacturer.

Whether terrible block production will affect the study and create speed and stability of SSD

Factory undesirable blocks might be separated around the bitline, so it can not have an effect on the erase and create speed of other blocks. Even so, if you can find adequate new undesirable blocks within the whole SSD, the out there blocks of the entire disk will likely be decreased, that will result in an increase within the quantity of garbage collections. The reduction in OP capacity will seriously affect the efficiency of garbage collection. Thus, increasing the amount of undesirable blocks to a particular level will impact the performance stability on the SSD, especially when the SSD is constantly written. Due to the fact the system performs garbage collection, it can lead to If the overall performance drops, the SSD functionality curve will fluctuate tremendously.

Bing brother individual WeChat, welcome to exchange:

Overview

The negative block management style notion is related to SSD reliability and efficiency. Some NandFlash vendors' poor block management practices may well not be really affordable. When solution style, if some abnormal conditions usually are not regarded as sufficient, it will often result in some unexpected terrible blocks.

For instance, after testing quite a few different most important manage SSDs, Bingge discovered that the issue of newly added bad blocks because of abnormal energy failure is very widespread. Searching for 'abnormal power failure produces terrible blocks' or equivalent keywords and phrases having a search engine The issue isn't only inside the testing course of action, you will discover also many complications that basically occur for the end user.

Who will handle the negative blocks

For the master without having a specific flash file method, the poor blocks is usually managed by the firmware with the SSD controller. For the specific flash file method, the undesirable blocks is often managed by the specific flash file method or Driver.

Negative blocks (BadBlock) are divided into three forms:

1. Ex-factory poor blocks, or initial negative blocks, that is definitely, blocks that do not meet the manufacturer's requirements or fail to meet the manufacturer's published standards at the time of shipment, have already been marked as poor blocks by the manufacturer in the factory; Some can't be Erase;

two. New undesirable blocks or bad blocks attributable to put on in the course of use;

three. Fake terrible blocks which can be misjudged by the key control due to abnormal power failure, etc .;

Not all of the newly added poor blocks are caused by put on. In the event the SSD doesn't have an abnormal power-off protection function, the abnormal power-off could result in the key handle to misjudge the undesirable blocks or make new ones. Devoid of abnormal power-off protection, when the Lowerpage has been effectively programmed, in addition to a sudden power failure during the Upperpage programming approach, it can inevitably result in data transmission errors inside the Lowerpage. If the number of data errors exceeds the SSDECC error correction capability, then it will likely be An error happens in the course of reading, plus the block will be judged as 'BadBlock' by the master and marked within the badblocktable.

Many of the newly added terrible blocks may be Erase, and just after the newly added bad blocks are erased, re-reading, reading and erasing the information could not cause errors again, simply because the error can also be connected for the pattern from the written information, use a certain pattern If a thing goes wrong, it may not be wrong to modify an additional pattern.

The ratio of factory negative blocks inside the complete Device

I've consulted several original NandFlash makers and gave a additional general statement: the ratio of negative blocks in the factory doesn't exceed 2%, and the manufacturer will leave a component with the margin to ensure that even when the maximum quantity of P / E promised by the manufacturer is reached, There is certainly nevertheless a poor block price of no additional than 2%. It seems that it truly is not a simple activity to guarantee 2%. The poor block price when Bingge got a brand new sample exceeded 2%, the actual test was two.55%

Strategy for determining negative blocks

1. Judgment strategy with the factory undesirable blocks

The scanning of undesirable blocks basically scans irrespective of whether the byte corresponding towards the address specified by the manufacturer has the FFh flag, and if there is no FFh, it's a poor block.

The place in the negative block identification is roughly the exact same for each manufacturer. For SLC and MLC, the place is various. Take Micron as an instance:

1.1 For the SLC of smaller pages (528Byte), does the sixth Byte within the sparearea of ​​the initial web page of every block have the FFh flag, if not, it really is a terrible block;

1.2 For SLCs with huge pages (greater than or equal to 2112 Bytes), do the initial and sixth Bytes on the Sparearea of ​​the first page of each Block have the FFh flag, if not, it is a undesirable block;

1.three For MLC, the factory poor blocks are scanned by scanning the initial web page and also the final web page in the 1st and second Bytes of every block to determine in the event the first or second Byte may be the 0xFF flag, which can be 0xFF, which can be so quick, there's no 0xFF It's a negative block.

To borrow a image from Hynixdatasheet to illustrate:

What data is inside the undesirable block? All 0s or all 1s? The results observed by Bingge's test are as follows. Of course, this might not be the truth. The factory undesirable blocks may be true, nevertheless it is not necessary to add new poor blocks, otherwise it truly is not impossible to hide data by way of 'bad blocks'

Can the factory terrible blocks be erased

Some are 'can' erased, and some are prohibited by the manufacturer. The so-called 'can' erase only indicates that the undesirable block identification is usually changed by sending an erase command, rather than suggesting that terrible blocks can be applied.

The manufacturer strongly recommends to not erase the negative block. After the poor block flag is erased, it can't be 'recovered'. Writing information around the bad block is risky.

two. In the procedure of working with, the judgment approach of newly added poor blocks

The newly added terrible block should be to judge no matter if the operation of NandFlash is effective by way of the feedback outcome with the status register. When the Plan or Erase, when the status register feedback is fail, the SSD main handle will list the block as a negative block.

Particularly:

two.1. Error when executing erase command;
2.2. Error when executing write command;
two.3. An error happens when the read command is executed; when the study command is executed, in the event the quantity of bit errors exceeds the error correction capability with the ECC, the block will likely be judged as a undesirable block.

Terrible block management technique

Negative blocks are managed by generating and updating the negative block table (BadBlockTable: BBT). There's no uniform specification and practice for the poor block table. Some engineers use a table to handle the factory poor blocks and newly added negative blocks, some engineers will handle the two tables separately, and a few engineers will treat the initial terrible blocks as separate Table, factory terrible blocks plus new undesirable blocks as yet another table.

For the content material of your undesirable block table, the expression isn't consistent, and some will probably be expressed additional roughly, for example: use 0 to indicate quick, use 1 to indicate undesirable blocks or vice versa. Some engineers will use a extra detailed description, such as: 00 for bad blocks in the factory, 01 for undesirable blocks when Program fails, 10 for undesirable blocks when Read fails, and 11 for poor blocks when Erase fails.

The poor block table is generally saved within a separate region (eg Block0, page0 and Block1, page1). It is actually more effective to read BBT directly immediately after each power-on. Considering that NandFlash itself will also be broken, it may result in the loss of BBT As a result, BBT is normally used for backup processing. The amount of backups is various for each home. A number of people back up two and other individuals back up. Normally, how to repair unreadable flash drive is possible to make use of the probability theory voting program to calculate, regardless of what, no less than Additional than two copies.

Undesirable block management methods typically contain: terrible block skip method and undesirable block replacement technique;

Negative block skip strategy

1. For the initial poor block, the poor block skip will skip the corresponding poor block through BBT and directly retailer the data within the next good block.

2. For the newly added undesirable block, update the terrible block to BBT, transfer the valid information within the terrible block to the next superior block, and skip directly when undertaking the corresponding Read, Plan or Erse in the future This negative block.

Bad block replacement strategy (recommended by a NandFlash vendor)

Bad block replacement refers to replacing terrible blocks generated during use with superior blocks inside the reserved location. Suppose that through the system, the nth web page has an error, then beneath the terrible block replacement approach, the information in page0 to page (n-1) is going to be copied to the same position from the totally free Block (eg BlockD) in the reserved region, Then write the data of the nth web page within the data register towards the pagen in BlockD.

The manufacturer's recommended method is always to divide the entire data location into two components. One part may be the user-visible location, that is used for typical information operations by the user, as well as the other component is really a spare area specially ready for replacing the poor block, which can be made use of to retailer the data for replacing the undesirable block and Save the bad block table, the proportion of your spare location is 2% in the entire capacity.

When a undesirable block is generated, FTL will remap the BadBlock address for the excellent block address inside the reserved area, rather than directly skipping the negative block to the next superior block. Before each write operation towards the logical address, which physical address will likely be calculated very first You'll be able to create which addresses are negative blocks, and if it is a undesirable block, create the data for the address on the corresponding reserved region.

Brother Bing did not see any suggestion about whether 2% on the reserved location ought to be included within the OP location or an more location, nor did he see a description of irrespective of whether the 2% of your reserved location was dynamic or static, and the joining was an independent region And it truly is a static location, then this approach will have the following disadvantages:

1. Straight reserve 2% with the region for undesirable block replacement, that will lessen the offered capacity and waste space. At the very same time, due to the modest number of obtainable blocks, the average number of accessible terrible put on is accelerated; two. Assuming that the available area has more poor blocks At 2%, it suggests that all the reserved places are replaced, and also the bad blocks generated will not be processed, along with the SSD will face the end of life.

Terrible block replacement tactic (the practice of some SSD suppliers)

The truth is, within the true item design, it really is hardly ever seen that a 2% ratio is reserved as a terrible block replacement area. In general, the OP (OverProvison) area freeblock will probably be utilized to replace the new addition during the use course of action. For negative blocks, take garbage collection as an example. When the garbage collection mechanism is operating, first move the valid page data in the Block that should be recovered for the freeBlock, then carry out Erase operation on this Block. Assume that the Erase status register reports that Erase failed. The poor block management mechanism will update this Block address for the new poor block list, in the same time, create the valid information pages inside the bad block towards the FreeBlock within the OP area, update the poor block management table, the next time you create data , Straight skip the negative block towards the subsequent out there block.