ultralytics 8.0.196 instance-mean Segment loss (#5285)

Co-authored-by: Andy <39454881+yermandy@users.noreply.github.com>
2023-10-09 20:08:39 +02:00 · 2023-10-09 20:08:39 +02:00 · e7f0658744
commit e7f0658744
parent 7517667a33
72 changed files with 369 additions and 493 deletions
--- a/docs/modes/export.md
+++ b/docs/modes/export.md
@ -14,9 +14,9 @@ The ultimate goal of training a model is to deploy it for real-world application

 <p align="center">
  <br>
-  <iframe width="720" height="405" src="https://www.youtube.com/embed/WbomGeoOT_k?si=aGmuyooWftA0ue9X" 
-    title="YouTube video player" frameborder="0" 
-    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" 
+  <iframe width="720" height="405" src="https://www.youtube.com/embed/WbomGeoOT_k?si=aGmuyooWftA0ue9X"
+    title="YouTube video player" frameborder="0"
+    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
    allowfullscreen>
  </iframe>
  <br>
--- a/docs/modes/predict.md
+++ b/docs/modes/predict.md
@ -14,9 +14,9 @@ In the world of machine learning and computer vision, the process of making sens

 <p align="center">
  <br>
-  <iframe width="720" height="405" src="https://www.youtube.com/embed/QtsI0TnwDZs?si=ljesw75cMO2Eas14" 
-    title="YouTube video player" frameborder="0" 
-    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" 
+  <iframe width="720" height="405" src="https://www.youtube.com/embed/QtsI0TnwDZs?si=ljesw75cMO2Eas14"
+    title="YouTube video player" frameborder="0"
+    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
    allowfullscreen>
  </iframe>
  <br>
@ -415,10 +415,10 @@ All Ultralytics `predict()` calls will return a list of `Results` objects:

    ```python
    from ultralytics import YOLO
-    
+
    # Load a pretrained YOLOv8n model
    model = YOLO('yolov8n.pt')
-    
+
    # Run inference on an image
    results = model('bus.jpg')  # list of 1 Results object
    results = model(['bus.jpg', 'zidane.jpg'])  # list of 2 Results objects
@ -467,13 +467,13 @@ For more details see the `Results` class [documentation](../reference/engine/res

    ```python
    from ultralytics import YOLO
-    
+
    # Load a pretrained YOLOv8n model
    model = YOLO('yolov8n.pt')
-    
+
    # Run inference on an image
    results = model('bus.jpg')  # results list
-    
+
    # View results
    for r in results:
        print(r.boxes)  # print the Boxes object containing the detection bounding boxes
@ -505,13 +505,13 @@ For more details see the `Boxes` class [documentation](../reference/engine/resul

    ```python
    from ultralytics import YOLO
-    
+
    # Load a pretrained YOLOv8n-seg Segment model
    model = YOLO('yolov8n-seg.pt')
-    
+
    # Run inference on an image
    results = model('bus.jpg')  # results list
-    
+
    # View results
    for r in results:
        print(r.masks)  # print the Masks object containing the detected instance masks
@ -538,13 +538,13 @@ For more details see the `Masks` class [documentation](../reference/engine/resul

    ```python
    from ultralytics import YOLO
-    
+
    # Load a pretrained YOLOv8n-pose Pose model
    model = YOLO('yolov8n-pose.pt')
-    
+
    # Run inference on an image
    results = model('bus.jpg')  # results list
-    
+
    # View results
    for r in results:
        print(r.keypoints)  # print the Keypoints object containing the detected keypoints
@ -572,13 +572,13 @@ For more details see the `Keypoints` class [documentation](../reference/engine/r

    ```python
    from ultralytics import YOLO
-    
+
    # Load a pretrained YOLOv8n-cls Classify model
    model = YOLO('yolov8n-cls.pt')
-    
+
    # Run inference on an image
    results = model('bus.jpg')  # results list
-    
+
    # View results
    for r in results:
        print(r.probs)  # print the Probs object containing the detected class probabilities
@ -622,9 +622,9 @@ You can use the `plot()` method of a `Result` objects to visualize predictions.
        im.show()  # show image
        im.save('results.jpg')  # save image
    ```
-    
+
    The `plot()` method supports the following arguments:
-    
+
    | Argument     | Type            | Description                                                                    | Default       |
    |--------------|-----------------|--------------------------------------------------------------------------------|---------------|
    | `conf`       | `bool`          | Whether to plot the detection confidence score.                                | `True`        |
--- a/docs/modes/track.md
+++ b/docs/modes/track.md
@ -21,9 +21,9 @@ The output from Ultralytics trackers is consistent with standard object detectio

 <p align="center">
  <br>
-  <iframe width="720" height="405" src="https://www.youtube.com/embed/hHyHmOtmEgs?si=VNZtXmm45Nb9s-N-" 
-    title="YouTube video player" frameborder="0" 
-    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" 
+  <iframe width="720" height="405" src="https://www.youtube.com/embed/hHyHmOtmEgs?si=VNZtXmm45Nb9s-N-"
+    title="YouTube video player" frameborder="0"
+    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
    allowfullscreen>
  </iframe>
  <br>
@ -86,7 +86,7 @@ To run the tracker on video streams, use a trained Detect, Segment or Pose model
        yolo track model=path/to/best.pt source="https://youtu.be/LNwODJXcvt4"  # Custom trained model

        # Track using ByteTrack tracker
-        yolo track model=path/to/best.pt tracker="bytetrack.yaml" 
+        yolo track model=path/to/best.pt tracker="bytetrack.yaml"
        ```

 As can be seen in the above usage, tracking is available for all Detect, Segment and Pose models run on videos or streaming sources.
@ -199,38 +199,38 @@ In the following example, we demonstrate how to utilize YOLOv8's tracking capabi

    ```python
    from collections import defaultdict
-    
+
    import cv2
    import numpy as np
-    
+
    from ultralytics import YOLO
-    
+
    # Load the YOLOv8 model
    model = YOLO('yolov8n.pt')
-    
+
    # Open the video file
    video_path = "path/to/video.mp4"
    cap = cv2.VideoCapture(video_path)
-    
+
    # Store the track history
    track_history = defaultdict(lambda: [])
-    
+
    # Loop through the video frames
    while cap.isOpened():
        # Read a frame from the video
        success, frame = cap.read()
-    
+
        if success:
            # Run YOLOv8 tracking on the frame, persisting tracks between frames
            results = model.track(frame, persist=True)
-    
+
            # Get the boxes and track IDs
            boxes = results[0].boxes.xywh.cpu()
            track_ids = results[0].boxes.id.int().cpu().tolist()
-    
+
            # Visualize the results on the frame
            annotated_frame = results[0].plot()
-    
+
            # Plot the tracks
            for box, track_id in zip(boxes, track_ids):
                x, y, w, h = box
@ -238,21 +238,21 @@ In the following example, we demonstrate how to utilize YOLOv8's tracking capabi
                track.append((float(x), float(y)))  # x, y center point
                if len(track) > 30:  # retain 90 tracks for 90 frames
                    track.pop(0)
-    
+
                # Draw the tracking lines
                points = np.hstack(track).astype(np.int32).reshape((-1, 1, 2))
                cv2.polylines(annotated_frame, [points], isClosed=False, color=(230, 230, 230), thickness=10)
-    
+
            # Display the annotated frame
            cv2.imshow("YOLOv8 Tracking", annotated_frame)
-    
+
            # Break the loop if 'q' is pressed
            if cv2.waitKey(1) & 0xFF == ord("q"):
                break
        else:
            # Break the loop if the end of the video is reached
            break
-    
+
    # Release the video capture object and close the display window
    cap.release()
    cv2.destroyAllWindows()
@ -283,36 +283,36 @@ Finally, after all threads have completed their task, the windows displaying the
    def run_tracker_in_thread(filename, model, file_index):
        """
        Runs a video file or webcam stream concurrently with the YOLOv8 model using threading.
-    
+
        This function captures video frames from a given file or camera source and utilizes the YOLOv8 model for object
        tracking. The function runs in its own thread for concurrent processing.
-    
+
        Args:
            filename (str): The path to the video file or the identifier for the webcam/external camera source.
            model (obj): The YOLOv8 model object.
            file_index (int): An index to uniquely identify the file being processed, used for display purposes.
-    
+
        Note:
            Press 'q' to quit the video display window.
        """
        video = cv2.VideoCapture(filename)  # Read the video file
-    
+
        while True:
            ret, frame = video.read()  # Read the video frames
-    
+
            # Exit the loop if no more frames in either video
            if not ret:
                break
-    
+
            # Track objects in frames if available
            results = model.track(frame, persist=True)
            res_plotted = results[0].plot()
            cv2.imshow(f"Tracking_Stream_{file_index}", res_plotted)
-    
+
            key = cv2.waitKey(1)
            if key == ord('q'):
                break
-    
+
        # Release video sources
        video.release()

--- a/docs/modes/train.md
+++ b/docs/modes/train.md
@ -14,9 +14,9 @@ Training a deep learning model involves feeding it data and adjusting its parame

 <p align="center">
  <br>
-  <iframe width="720" height="405" src="https://www.youtube.com/embed/LNwODJXcvt4?si=7n1UvGRLSd9p5wKs" 
-    title="YouTube video player" frameborder="0" 
-    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" 
+  <iframe width="720" height="405" src="https://www.youtube.com/embed/LNwODJXcvt4?si=7n1UvGRLSd9p5wKs"
+    title="YouTube video player" frameborder="0"
+    allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
    allowfullscreen>
  </iframe>
  <br>
@ -240,7 +240,7 @@ To use Comet:
        ```python
        # pip install comet_ml
        import comet_ml
-        
+
        comet_ml.init()
        ```

@ -258,7 +258,7 @@ To use ClearML:
        ```python
        # pip install clearml
        import clearml
-        
+
        clearml.browser_login()
        ```