9.9 KiB
Code is clean if it can be understood easily – by everyone on the team. Clean code can be read and enhanced by a developer other than its original author. With understandability comes readability, changeability, extensibility and maintainability.
General rules
- Follow standard conventions.
- Keep it simple stupid. Simpler is always better. Reduce complexity as much as possible.
- Boy scout rule. Leave the campground cleaner than you found it.
- Always find root cause. Always look for the root cause of a problem.
Design rules
- Keep configurable data at high levels.
- Prefer polymorphism to if/else or switch/case.
- Separate multi-threading code.
- Prevent over-configurability.
- Use dependency injection.
- Follow Law of Demeter. A class should know only its direct dependencies.
Understandability tips
- Be consistent. If you do something a certain way, do all similar things in the same way.
- Use explanatory variables.
- Encapsulate boundary conditions. Boundary conditions are hard to keep track of. Put the processing for them in one place.
- Prefer dedicated value objects to primitive type.
- Avoid logical dependency. Don't write methods which works correctly depending on something else in the same class.
- Avoid negative conditionals.
Names rules
- Choose descriptive and unambiguous names.
- Make meaningful distinction.
- Use pronounceable names.
- Use searchable names.
- Replace magic numbers with named constants.
- Avoid encodings. Don't append prefixes or type information.
Functions rules
- Small.
- Do one thing.
- Use descriptive names.
- Prefer fewer arguments.
- Have no side effects.
- Don't use flag arguments. Split method into several independent methods that can be called from the client without the flag.
Comments rules
- Always try to explain yourself in code.
- Don't be redundant.
- Don't add obvious noise.
- Don't use closing brace comments.
- Don't comment out code. Just remove.
- Use as explanation of intent.
- Use as clarification of code.
- Use as warning of consequences.
Source code structure
- Separate concepts vertically.
- Related code should appear vertically dense.
- Declare variables close to their usage.
- Dependent functions should be close.
- Similar functions should be close.
- Place functions in the downward direction.
- Keep lines short.
- Don't use horizontal alignment.
- Use white space to associate related things and disassociate weakly related.
- Don't break indentation.
Objects and data structures
- Hide internal structure.
- Prefer data structures.
- Avoid hybrids structures (half object and half data).
- Should be small.
- Do one thing.
- Small number of instance variables.
- Base class should know nothing about their derivatives.
- Better to have many functions than to pass some code into a function to select a behavior.
- Prefer non-static methods to static methods.
Tests
- One assert per test.
- Readable.
- Fast.
- Independent.
- Repeatable.
Code smells
- Rigidity. The software is difficult to change. A small change causes a cascade of subsequent changes.
- Fragility. The software breaks in many places due to a single change.
- Immobility. You cannot reuse parts of the code in other projects because of involved risks and high effort.
- Needless Complexity.
- Needless Repetition.
- Opacity. The code is hard to understand.
Clean Code
Here are some general principles for writing highly usable, functional, reliable, fast, and scalable code:
-
Clear and Understandable: The code should be written in a way that's easy for others to understand. This includes using clear variable and function names, and including comments to explain complex sections of code.
-
Modular and Reusable: Code should be broken down into small, modular functions and classes that each perform a single task. This makes the code more understandable, and also allows for code reuse.
-
Robust Error Handling: The code should be able to handle all potential errors gracefully, and should never crash unexpectedly. This includes checking for invalid input, catching exceptions, and providing useful error messages.
-
Type Handling: Whenever possible, the code should enforce and check types to prevent type-related errors. This can be done through the use of type hints in languages like Python, or through explicit type checks.
-
Logging: The code should include extensive logging to make it easier to debug and understand what the code is doing. This includes logging any errors that occur, as well as important events or state changes.
-
Performance: The code should be optimized for performance, avoiding unnecessary computation and using efficient algorithms and data structures. This includes profiling the code to identify and optimize performance bottlenecks.
-
Scalability: The code should be designed to scale well as the size of the input data or the number of users increases. This includes using scalable algorithms and data structures, and designing the code to work well in a distributed or parallel computing environment if necessary.
-
Testing: The code should include comprehensive tests to ensure that it works correctly. This includes unit tests for individual functions and classes, as well as integration tests to ensure that the different parts of the code work well together.
-
Version Control: The code should be stored in a version control system like Git, which allows for tracking changes, collaborating with others, and rolling back to a previous state if necessary.
-
Documentation: The codebase should be well-documented, both in terms of comments within the code and external documentation that explains how to use and contribute to the code.
-
Continuous Integration/Continuous Deployment (CI/CD): Implement CI/CD pipelines for automatic testing and deployment. This ensures that any new changes do not break existing functionality and that the latest version of the application is always available for deployment.
Examples
-
Clear and Understandable: Use meaningful variable and function names. Include comments when necessary.
# Good example def calculate_average(numbers: List[int]) -> float: """Calculate and return the average of a list of numbers.""" total = sum(numbers) count = len(numbers) return total / count
For file and folder names, use descriptive names that relate to their function in your program. For example, a file that contains functions for handling user input might be named
user_input.py
. -
Modular and Reusable: Write functions for tasks that you perform over and over.
def greet_user(name: str): """Print a greeting to the user.""" print(f"Hello, {name}!")
For folder structure, group related files in the same directory. For example, all test files could be in a
tests
directory. -
Robust Error Handling: Use try/except blocks to catch and handle errors.
def divide_numbers(numerator: float, denominator: float) -> float: """Divide two numbers and handle division by zero.""" try: return numerator / denominator except ZeroDivisionError: print("Error: Division by zero.") return None
-
Type Handling: Use type hints to specify the type of function arguments and return values.
def greet_user(name: str) -> None: """Greet the user.""" print(f"Hello, {name}!")
-
Logging: Use the
logging
module to log events.import logging logging.basicConfig(level=logging.INFO) def divide_numbers(numerator: float, denominator: float) -> float: """Divide two numbers and log if division by zero occurs.""" try: return numerator / denominator except ZeroDivisionError: logging.error("Attempted division by zero.") return None
-
Performance: Use built-in functions and data types for better performance.
# Using a set to check for membership is faster than using a list numbers_set = set(numbers) if target in numbers_set: print(f"{target} is in the set of numbers.")
-
Scalability: For scalability, an example might involve using a load balancer or dividing tasks among different workers or threads. This is more of a system design consideration than a single piece of code.
-
Testing: Write tests for your functions.
def test_calculate_average(): assert calculate_average([1, 2, 3, 4]) == 2.5
For tests, you could have a separate
tests
directory. Inside this directory, each test file could be namedtest_<filename>.py
where<filename>
is the name of the file being tested. -
Version Control: This point refers to using tools like Git for version control. A simple example would be committing changes to a repository:
git add . git commit -m "Add function to calculate average" git push
-
Documentation: Write docstrings for your functions.
def calculate_average(numbers: List[int]) -> float: """Calculate and return the average of a list of numbers.""" ...
Documentation might be kept in a
docs
directory, with separate files for different topics. -
Continuous Integration/Continuous Deployment (CI/CD): This is typically handled by a system like Jenkins, GitHub Actions, or GitLab CI/CD. It involves creating a script or configuration file that tells the CI/CD system how to build, test, and deploy your code. For example, a
.github/workflows/main.yml
file for a GitHub Actions workflow.
Remember, consistency in your naming conventions and organization is key. Having a standard and sticking to it will make your codebase easier to navigate and understand.