Skip to content

Commit

Permalink
nit: rename falcon
Browse files Browse the repository at this point in the history
  • Loading branch information
ishaansehgal99 committed Oct 5, 2023
1 parent b9dc72c commit 5280fd8
Show file tree
Hide file tree
Showing 2 changed files with 38 additions and 2 deletions.
4 changes: 2 additions & 2 deletions pkg/presets/falcon/inference-api.py
Original file line number Diff line number Diff line change
Expand Up @@ -14,10 +14,10 @@
import torch
import torch.distributed as dist

parser = argparse.ArgumentParser(description='HuggingFace Model Configuration')
parser = argparse.ArgumentParser(description='Falcon Model Configuration')
parser.add_argument('--load_in_8bit', default=False, action='store_true', help='Load model in 8-bit mode')
parser.add_argument('--disable_trust_remote_code', default=False, action='store_true', help='Disable trusting remote code when loading the model')
parser.add_argument('--model_id', required=True, type=str, help='The HuggingFace ID for the pre-trained model')
parser.add_argument('--model_id', required=True, type=str, help='The Falcon ID for the pre-trained model')
args = parser.parse_args()

app = FastAPI()
Expand Down
36 changes: 36 additions & 0 deletions pkg/presets/llama-webserver/pod.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
# deployment.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
name: file-download-app
spec:
replicas: 1
selector:
matchLabels:
app: file-download-app
template:
metadata:
labels:
app: file-download-app
spec:
containers:
- name: my-container
image: smritidahal/file-download-app:latest
ports:
- containerPort: 80
---
# service.yaml
apiVersion: v1
kind: Service
metadata:
name: file-download-service
spec:
selector:
app: file-download-app
ports:
- protocol: TCP
port: 80
targetPort: 5000
type: LoadBalancer


0 comments on commit 5280fd8

Please sign in to comment.